Codestin Search App

kubaflo · 2026-02-10T20:32:01Z

Agent Workflow Labels

GitHub labels for tracking outcomes of the AI agent PR review workflow (Review-PR.ps1).

All labels use the s/agent-* prefix for easy querying on GitHub.

Label Categories

Outcome Labels

Mutually exclusive — exactly one is applied per PR review run.

Label	Color	Description	Applied When
`s/agent-approved`	🟢 `#2E7D32`	AI agent recommends approval — PR fix is correct and optimal	Report phase recommends APPROVE
`s/agent-changes-requested`	🟠 `#E65100`	AI agent recommends changes — found a better alternative or issues	Report phase recommends REQUEST CHANGES
`s/agent-review-incomplete`	🔴 `#B71C1C`	AI agent could not complete all phases (blocker, timeout, error)	Agent exits without completing all phases

When a new outcome label is applied, any previously applied outcome label is automatically removed.

Signal Labels

Additive — multiple can coexist on a single PR.

Label	Color	Description	Applied When
`s/agent-gate-passed`	🟢 `#4CAF50`	AI verified tests catch the bug (fail without fix, pass with fix)	Gate phase passes
`s/agent-gate-failed`	🟠 `#FF9800`	AI could not verify tests catch the bug	Gate phase fails
`s/agent-fix-win`	🟢 `#66BB6A`	AI found a better alternative fix than the PR	Fix phase: alternative selected over PR's fix
`s/agent-fix-lose`	🟠 `#FF7043`	AI could not beat the PR fix — PR is the best among all candidates	Fix phase: PR selected as best after comparison

Gate labels (gate-passed/gate-failed) are mutually exclusive with each other. Fix labels (fix-win/fix-lose) are mutually exclusive with each other.

Tracking Label

Always applied on every completed agent run.

Label	Color	Description	Applied When
`s/agent-reviewed`	🔵 `#1565C0`	PR was reviewed by AI agent workflow (full 4-phase review)	Every completed agent run

Manual Label

Applied by MAUI maintainers, not by automation.

Label	Color	Description	Applied When
`s/agent-fix-implemented`	🟣 `#7B1FA2`	PR author implemented the agent's suggested fix	Maintainer applies when PR author adopts agent's recommendation
`s/agent-suggestions-implemented`	🟣 `#7B1FA2`	PR author implemented the agent's code suggestions	Maintainer applies when PR author adopts agent's recommendation

How It Works

Architecture

Review-PR.ps1
├── Phase 1: PR Agent Review (Copilot CLI)
│   ├── Pre-Flight → writes content.md
│   ├── Gate       → writes content.md
│   ├── Fix        → writes content.md
│   └── Report     → writes content.md
├── Phase 2: PR Finalize (optional)
├── Phase 3: Post Comments (optional)
└── Phase 4: Apply Labels  ← labels are applied here
    ├── Parse content.md files
    ├── Determine outcome + signal labels
    ├── Apply via GitHub REST API
    └── Non-fatal: errors warn but don't fail the workflow

Labels are applied exclusively from Review-PR.ps1 Phase 4. No other script applies agent labels. This single-source design avoids label conflicts and simplifies debugging.

How Labels Are Parsed

The Parse-PhaseOutcomes function in Update-AgentLabels.ps1 reads content.md files from each phase directory:

Source File	What's Parsed	Resulting Label
`gate/content.md`	`Result: ✅ PASSED`	`s/agent-gate-passed`
`gate/content.md`	`Result: ❌ FAILED`	`s/agent-gate-failed`
`try-fix/content.md`	`Selected Fix: Candidate ...`	`s/agent-fix-win`
`try-fix/content.md`	`Selected Fix: PR ...`	`s/agent-fix-lose`
`report/content.md`	`Final Recommendation: APPROVE`	`s/agent-approved`
`report/content.md`	`Final Recommendation: REQUEST CHANGES`	`s/agent-changes-requested`
(missing report)	No report file exists	`s/agent-review-incomplete`

Self-Bootstrapping

Labels are created automatically on first use via Ensure-LabelExists. No manual setup required. If a label already exists but has a stale description or color, it is updated.

Querying Labels

All labels use the s/agent-* prefix, making them easy to filter on GitHub.

Common Queries

# PRs the agent approved
is:pr label:s/agent-approved

# PRs where agent found a better fix
is:pr label:s/agent-fix-lose

# PRs where agent found better fix AND author implemented it
is:pr label:s/agent-changes-requested label:s/agent-fix-implemented

# PRs where tests don't catch the bug
is:pr label:s/agent-gate-failed

# Agent-reviewed PRs that are still open
is:pr is:open label:s/agent-reviewed

# All agent-reviewed PRs (total count)
is:pr label:s/agent-reviewed

Metrics You Can Derive

Metric	Query
Total agent reviews	`is:pr label:s/agent-reviewed`
Approval rate	Compare `label:s/agent-approved` vs `label:s/agent-changes-requested` counts
Gate pass rate	Compare `label:s/agent-gate-passed` vs `label:s/agent-gate-failed` counts
Fix win rate	Compare `label:s/agent-fix-win` vs `label:s/agent-fix-lose` counts
Agent adoption rate	`label:s/agent-fix-implemented` / `label:s/agent-changes-requested`
Incomplete review rate	`label:s/agent-review-incomplete` / `label:s/agent-reviewed`

Implementation Details

Files

File	Purpose
`.github/scripts/shared/Update-AgentLabels.ps1`	Label helper module (all label logic)
`.github/scripts/Review-PR.ps1`	Orchestrator that calls `Apply-AgentLabels` in Phase 4
`.github/agents/pr/SHARED-RULES.md`	Documents label system for the PR agent

Key Functions

Function	Description
`Apply-AgentLabels`	Main entry point — parses phases and applies all labels
`Parse-PhaseOutcomes`	Reads `content.md` files, returns outcome/gate/fix results
`Update-AgentOutcomeLabel`	Applies one outcome label, removes conflicting ones
`Update-AgentSignalLabels`	Adds/removes gate and fix signal labels
`Update-AgentReviewedLabel`	Ensures tracking label is present
`Ensure-LabelExists`	Creates or updates a label in the repository

Design Principles

Idempotent: Safe to re-run — checks before add/remove, GitHub ignores duplicate adds
Non-fatal: Label failures emit warnings but never fail the overall workflow
Single source: All labels applied from Review-PR.ps1 only — no other scripts touch labels
Self-bootstrapping: Labels are created on first use via GitHub API
Mutual exclusivity enforced: Outcome labels and same-category signal labels automatically remove their counterpart

Migrated From

The following old infrastructure was removed as part of this implementation:

Update-VerificationLabels function in verify-tests-fail.ps1 — removed (labels now come from Review-PR.ps1 only)
s/ai-reproduction-confirmed / s/ai-reproduction-failed labels — superseded by s/agent-gate-passed / s/agent-gate-failed

Clarify the Copilot PR-review prompt to execute five explicit phases (Understanding, Test Review, Fix Exploration, Alternative Comparison, Final Review). Add a new pipeline step that invokes a Copilot "post-comment" skill, captures its exit code, logs output to $(Build.ArtifactStagingDirectory)/copilot-logs, and sets a PostCommentFailed variable on failure. Ensure artifacts dir exists, surface warnings on failure, and make the original post-comment fallback step run only when the skill step failed. Update step display names and preserve artifact publishing.

Add a pipeline step (Cache Prompt File) that loads eng/pipelines/prompts/pr-review-prompt.md and copies it to /tmp/copilot-prompts/pr-review-prompt.md before the PR branch checkout. Update the later Copilot step to read the prompt from the cached location and adjust the error message. This prevents failures when the prompt file is absent on the PR branch by ensuring a stable copy is available for the review step.

Replace brittle iPhone Xs + iOS 18.5 lookup with a prioritized selection routine. The script now iterates preferred iOS versions and device models, picks the first available preferred device or falls back to the first available iPhone, and finally falls back to any available simulator. It also surfaces runtime info when listing available simulators and reports the selected device name and UDID before booting.

Replace direct checkout of the PR branch with logic that fetches the PR, computes the merge-base against the current branch, and cherry-picks commits from the merge-base..PR_HEAD onto the current branch (using --no-commit). Adds commit counting, a warning when no commits are found, conflict handling with status and diff output, and extra logging (current branch, merge base, commit count, last commit and status). Also updates the pipeline step display name to 'Cherry-pick PR Changes' and tweaks the fetch message.

Add a CI step to run ./build.ps1 --target=dotnet-buildtasks (Release, diagnostic) to compile MSBuild tasks required for MAUI builds. The step includes a retry on failure and sets DOTNET_TOKEN and PRIVATE_BUILD environment variables for accessing internal artifacts. Placed before the simulator/emulator listing to ensure tasks are available for subsequent MAUI jobs.

Introduce structured PRAgent phase output directories and content.md artifacts, and make agent workflows non-interactive/CI-first. Documentation (.github/agents/*, SKILLs and plan templates) updated to require writing phase outputs to CustomAgentLogsTmp/PRState/{PRNumber}/PRAgent/{phase}/content.md, to prefer continuing autonomously on environment blockers (retry once, then skip) and to remove strict requirements to create a single monolithic state file before starting. Review-PR.ps1 now creates PRAgent phase directories, documents CI-mode behavior and phase output paths, adjusts pr-finalize output locations, and updates labeling invocation. Several ai-summary-comment docs/scripts were updated to read generic "content" artifacts (and to remove SkipValidation usage). Overall changes align agent scripts and docs with CI-friendly, structured phase outputs and clearer failure/retry semantics.

Remove stale agent session notes and tighten PR scripting behavior. Changes: - Deleted .github/agent-pr-session/*.md (removed archived agent session files). - .github/scripts/Review-PR.ps1: updated PR log directory and display path to use "PRAgent/copilot-logs" subfolder. - .github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1: stop falling back to an "unknown" PR folder; now errors and exits if -PRNumber is not provided. Reason: avoid ambiguous "unknown" PR artifacts and standardize log location under PRAgent; fail early when PR number is missing to prevent accidental runs with incorrect paths.

Align docs and script to the new path layout under CustomAgentLogsTmp/PRState/{number}/PRAgent. Updated PLAN-TEMPLATE.md (post-pr-finalize SummaryFile path), SKILL.md (auto-loading description), and post-try-fix-comment.ps1 (examples, parameter docs, and path regex) so try-fix and finalize operations look in the PRAgent/try-fix and PRAgent/pr-finalize locations.

Replace CI-specific language with general "autonomous/non-interactive" phrasing and tighten guidance to not prompt a human operator. Updates remove or reword references to "CI mode" and emphasize skipping blocked phases, retrying once, and continuing autonomously. Affected files: .github/agents/pr.md, .github/agents/pr/PLAN-TEMPLATE.md, .github/agents/pr/SHARED-RULES.md, .github/agents/pr/post-gate.md, and .github/scripts/Review-PR.ps1.

Remove the optional -Content parameter and make post-ai-summary-comment.ps1 always load phase content from CustomAgentLogsTmp/PRState/<PRNumber>/PRAgent/*/content.md. Update script help, examples, and validation messages; refactor auto-load logic to locate the repo root, load available phase files (pre-flight, gate, try-fix, report), build a status table and per-phase details, and synthesize the final comment. Also update SKILL.md to remove the "Provide content directly" section, adjust the Parameters table, and clarify the auto-loading behavior in the documentation.

Save the current branch and commit SHA before running the PR agent and use that pinned restore point to reliably restore the working tree between phases. Detects if the agent or finalize step changed branch/HEAD and recovers via git checkout/reset to the saved branch+SHA; otherwise performs targeted checkouts from the pinned SHA. Also update targeted file recoveries to use the pinned SHA. Additionally, clarify the try-fix skill docs: the baseline script requires the PR changes to be present on the current branch and should be reported as Blocked rather than switching branches when fix files are missing.

Introduce centralized agent label management and documentation. Adds a new shared script (.github/scripts/shared/Update-AgentLabels.ps1) that parses phase content.md files and idempotently creates/applies outcome, signal, and tracking labels (s/agent-*) via the GH API. Integrates label application into Review-PR.ps1 as Phase 4 (with a recovery attempt if the helper is missing). Adds comprehensive docs (.github/docs/agent-labels.md) and documents labeling behavior in .github/agents/pr/SHARED-RULES.md. Removes the older, in-file verification label logic from verify-tests-fail.ps1 and its calls, consolidating label responsibilities into the new helper. Labels are applied non-fatally and auto-created/updated on first use. Update Update-AgentLabels.ps1 Rename s/agent-fix-lose label to s/agent-fix-pr-picked Co-Authored-By: Copilot <[email protected]>

Introduce a -Unified mode to post-pr-finalize-comment.ps1 and call it from Review-PR.ps1. When enabled, the script injects or updates a PR Finalization section inside the existing AI Summary comment (or creates a new unified AI Summary comment) using explicit markers and a collapsible details block; it also removes any legacy standalone finalize comment. Dry-run preview support was added (writes preview file), and existing standalone behavior remains the default when -Unified is not passed. Changes made in .github/scripts/Review-PR.ps1 and .github/skills/ai-summary-comment/scripts/post-pr-finalize-comment.ps1.

Update Start-Emulator.ps1 to select iOS simulators that match UI test baseline devices. Replace the single preferred device list with a per-iOS-version mapping (iOS-18/iOS-17 prefer iPhone Xs; iOS-26 prefers iPhone 11 Pro) and adjust the selection logic to use the version-specific preferences. Comments were updated to document why certain devices are preferred to ensure consistency with UITest.cs baselines.

When deploying or starting iOS simulators, add logic to detect any other booted simulators and shut them down to prevent Appium from connecting to the wrong device. Implements parsing of `xcrun simctl list devices --json` and shuts down any booted simulator whose UDID does not match the target in both Build-AndDeploy.ps1 and Start-Emulator.ps1. Also update the success message to include the simulator name for clearer logs.

@3x

…ype unavailable iPhone Xs device type (com.apple.CoreSimulator.SimDeviceType.iPhone-Xs) is not available on newer Xcode versions on CI agents. iPhone 11 Pro has the same screen resolution (1125x2436 @3x) so snapshots match the baselines captured on iPhone Xs. Fallback order: iPhone Xs (existing) → iPhone 11 Pro (existing) → create iPhone Xs → create iPhone 11 Pro → first available iPhone. Fix: Start-Emulator.ps1 respects DEVICE_UDID env var and prefers iPhone 11 Pro Two fixes: 1. Check $env:DEVICE_UDID before auto-detecting - the CI pipeline sets this via ##vso[task.setvariable] but Start-Emulator.ps1 was ignoring it 2. Add iPhone 11 Pro as second preferred device for iOS 18/17 (same 1125x2436 resolution as iPhone Xs) - iPhone Xs device type is unavailable on CI agents Fix CI iOS simulator selection to use iPhone Xs for snapshot baselines The CI pipeline was selecting iPhone 16 Pro (1206x2472) which doesn't match the UI test baseline screenshots captured on iPhone Xs (1124x2286). Changes: - Create iPhone Xs simulator if not available on CI agent - Target the latest stable iOS runtime (18.x preferred) - Shutdown other booted simulators to prevent Appium conflicts Co-Authored-By: Copilot <[email protected]>

dotnet#34156) Pipeline runs for the Copilot CI pipeline had no meaningful title, making it hard to identify runs at a glance. This adds a step immediately after `Validate Parameters` that renames the run to `PR: {PRNumber} {Platform}` using the Azure DevOps logging command. ## Change - **`eng/pipelines/ci-copilot.yml`**: Adds a `Set Pipeline Run Title` step after `Validate Parameters`: ```yaml - script: | echo "##vso[build.updatebuildnumber]PR: ${{ parameters.PRNumber }} ${{ parameters.Platform }}" displayName: 'Set Pipeline Run Title' ``` Produces titles like `PR: 1234 android` or `PR: 5678 ios`. Implemented as a bash `script:` for compatibility with the macOS agents used by this pipeline.  <details> <summary>Original prompt</summary> > Create a pull request in `dotnet/maui` (base branch `copilot-ci`) to update the Azure DevOps pipeline at `eng/pipelines/ci-copilot.yml` so that the pipeline run title/build number is updated early in the run. > > Requirements: > - Add a step shortly after the existing **Validate Parameters** step to rename the pipeline run using Azure DevOps logging command `##vso[build.updatebuildnumber]...`. > - The run title should be exactly: `PR: {PR number} {Platform}` where: > - PR number comes from parameter `${{ parameters.PRNumber }}` > - Platform comes from parameter `${{ parameters.Platform }}` > - Use a clear `displayName`, e.g. `Set Pipeline Run Title`. > - Keep the change minimal and do not alter existing behavior beyond setting the run title. > > Context: > - File source URL: https://github.com/dotnet/maui/blob/copilot-ci/eng/pipelines/ci-copilot.yml > - CommitOID (context): 4896e12 > > Notes: > - Implement as a YAML step using `script:` (bash) for maximum compatibility on macOS agents. > - Ensure the title format does not include parentheses—use a single space between PR number and platform, e.g. `PR: 1234 android`. </details>  *This pull request was created from Copilot chat.* >  --- ✨ Let Copilot coding agent [set things up for you](https://github.com/dotnet/maui/issues/new?title=✨+Set+up+Copilot+instructions&body=Configure%20instructions%20for%20this%20repository%20as%20documented%20in%20%5BBest%20practices%20for%20Copilot%20coding%20agent%20in%20your%20repository%5D%28https://gh.io/copilot-coding-agent-tips%29%2E%0A%0A%3COnboard%20this%20repo%3E&assignees=copilot) — coding agent works faster and does higher quality work when set up for your repo. --------- Co-authored-by: copilot-swe-agent[bot] <[email protected]> Co-authored-by: jfversluis <[email protected]>

Second test

This reverts commit 28bc3cc.

Introduce automated agent labeling for PR reviews: add a new shared labeler script (.github/scripts/shared/Update-AgentLabels.ps1) and wire it into Review-PR.ps1 as Phase 4 (Apply Labels). The labeler parses phase content.md files (gate/try-fix/report) to determine outcome, gate and fix signal labels, ensures labels exist, and applies/removes mutually-exclusive outcome/signal labels plus a tracking label (s/agent-reviewed). Add comprehensive docs (.github/docs/agent-labels.md) and update the PR agent SHARED-RULES.md to describe label meanings and expectations. Operations are idempotent and non-fatal; Review-PR.ps1 attempts a targeted recovery if the helper is missing.

Copilot

Pull request overview

This PR implements a comprehensive GitHub label-based metrics system for tracking AI agent PR review workflow outcomes. The system uses s/agent-* prefixed labels to track review outcomes, test verification results, and fix comparison results across the automated PR review pipeline.

Changes:

Introduces new label management module (Update-AgentLabels.ps1) with idempotent label operations
Adds Phase 4 to Review-PR.ps1 for automatic label application based on phase outcomes
Refactors agent output from centralized state files to distributed content.md files per phase
Updates all agent instructions and skill documentation to reflect the new phase output artifact structure
Removes old label management code (Update-VerificationLabels) in favor of centralized system
Adds new CI/Copilot pipeline configuration for automated agent PR reviews
Cleans up Azure DevOps variable groups and pipeline configuration

Reviewed changes

Copilot reviewed 26 out of 26 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`.github/scripts/shared/Update-AgentLabels.ps1`	New module implementing label management with parsing, application, and self-bootstrapping
`.github/scripts/Review-PR.ps1`	Adds Phase 4 for label application; implements pinned SHA restoration; adds phase output directories
`.github/docs/agent-labels.md`	Comprehensive documentation of the label system, architecture, and usage examples
`eng/pipelines/ci-copilot.yml`	New Azure DevOps pipeline for running Copilot PR reviewer agent with full environment setup
`eng/pipelines/common/variables.yml`	Simplifies variable group structure; removes unused conditional logic
`eng/pipelines/common/provision.yml`	Adds `skipCertificates` parameter for CI scenarios
`.github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1`	Removes old label management; updates output path to new structure
`.github/skills/try-fix/SKILL.md`	Updates documentation to remove state file references
`.github/skills/learn-from-pr/SKILL.md`	Removes session markdown references
`.github/skills/ai-summary-comment/scripts/*.ps1`	Updates all scripts to auto-load from PRAgent phase content.md files instead of state files
`.github/skills/ai-summary-comment/SKILL.md`	Documents new auto-loading behavior from phase files
`.github/skills/ai-summary-comment/NO-EXTERNAL-REFERENCES-RULE.md`	Simplifies by removing state file references
`.github/skills/ai-summary-comment/IMPROVEMENTS.md`	Updates terminology from "state file" to "content"
`.github/scripts/shared/Start-Emulator.ps1`	Improves iOS simulator selection logic for UI test baseline compatibility
`.github/scripts/shared/Build-AndDeploy.ps1`	Adds logic to shutdown other booted simulators before deployment
`.github/scripts/BuildAndRunHostApp.ps1`	Adds test artifact collection for screenshots and page source
`.github/copilot-instructions.md`	Updates agent documentation to reflect new output structure
`.github/agents/pr/post-gate.md`	Updates for autonomous execution mode and phase output artifacts
`.github/agents/pr/SHARED-RULES.md`	Major update: documents phase output artifacts and agent label system; changes blocking behavior to autonomous
`.github/agents/pr/PLAN-TEMPLATE.md`	Updates plan template to reflect new phase output requirements
`.github/agents/pr.md`	Removes state file creation steps; updates for phase output artifacts
`.github/agents/learn-from-pr.md`	Removes session markdown references

Copilot · 2026-02-26T00:30:09Z

.github/docs/agent-labels.md

+| `s/agent-fix-win` | 🟢 `#66BB6A` | AI found a better alternative fix than the PR | Fix phase: alternative selected over PR's fix |
+| `s/agent-fix-pr-picked` | 🟠 `#FF7043` | AI could not beat the PR fix — PR is the best among all candidates | Fix phase: PR selected as best after comparison |
+
+Gate labels (`gate-passed`/`gate-failed`) are mutually exclusive with each other. Fix labels (`fix-win`/`fix-lose`) are mutually exclusive with each other.


The label name s/agent-fix-pr-picked in the code does not match the documentation which refers to it as s/agent-fix-lose.

In the documentation at line 34, the table mentions "Fix labels (fix-win/fix-lose)" suggesting the label should be called s/agent-fix-lose, but the actual label defined in Update-AgentLabels.ps1 line 35 is s/agent-fix-pr-picked.

Either the code should use s/agent-fix-lose to match the documentation's naming pattern, or the documentation should be updated to consistently use s/agent-fix-pr-picked. The current mismatch could cause confusion when users try to query these labels.

Suggested change

Gate labels (`gate-passed`/`gate-failed`) are mutually exclusive with each other. Fix labels (`fix-win`/`fix-lose`) are mutually exclusive with each other.

Gate labels (`gate-passed`/`gate-failed`) are mutually exclusive with each other. Fix labels (`fix-win`/`fix-pr-picked`) are mutually exclusive with each other.

Copilot · 2026-02-26T00:30:09Z

.github/docs/agent-labels.md

+# PRs the agent approved
+is:pr label:s/agent-approved
+
+# PRs where agent found a better fix


The comment at line 107 says "PRs where agent found a better fix" but queries for s/agent-fix-pr-picked. This is semantically backwards.

According to the label definitions:

s/agent-fix-win = "AI found a better alternative fix than the PR"

s/agent-fix-pr-picked = "AI could not beat the PR fix — PR is the best"

So the query comment should say "PRs where agent could NOT beat the PR fix" or the query should use label:s/agent-fix-win instead.

Suggested change

# PRs where agent found a better fix

# PRs where agent could NOT beat the PR fix (PR fix was best)

Copilot · 2026-02-26T00:30:09Z

eng/pipelines/common/variables.yml

    - group: SDL_Settings
+    - group: AzureDevOps-Artifact-Feeds-Pats
+    - ${{ if eq(variables['Build.DefinitionName'], 'dotnet-maui') }}:
+      - group: Publish-Build-Assets # This variable group contains secrets to publis to BAR


Typo in the comment: "publis" should be "publish". The comment reads "This variable group contains secrets to publis to BAR" but should read "This variable group contains secrets to publish to BAR".

Suggested change

- group: Publish-Build-Assets # This variable group contains secrets to publis to BAR

- group: Publish-Build-Assets # This variable group contains secrets to publish to BAR

Copilot · 2026-02-26T00:30:10Z

.github/scripts/shared/Update-AgentLabels.ps1

+$script:ManualLabels = @{
+    's/agent-fix-implemented' = @{ Description = 'PR author implemented the agent suggested fix'; Color = '7B1FA2' }
+}


The PR description mentions TWO manual labels (s/agent-fix-implemented and s/agent-suggestions-implemented), but the code only defines ONE manual label (s/agent-fix-implemented).

The PR description states:

| `s/agent-fix-implemented` | 🟣 `#7B1FA2` | PR author implemented the agent's suggested fix | Maintainer applies when PR author adopts agent's recommendation | | `s/agent-suggestions-implemented` | 🟣 `#7B1FA2` | PR author implemented the agent's code suggestions | Maintainer applies when PR author adopts agent's recommendation |

However, Update-AgentLabels.ps1 only defines s/agent-fix-implemented (line 39), and the documentation only documents s/agent-fix-implemented (line 50). Either add the second manual label to the code, or remove it from the PR description.

Copilot · 2026-02-26T00:30:10Z

.github/scripts/shared/Update-AgentLabels.ps1

+            if ($reportContent -match '(?i)Final\s+Recommendation:\s*APPROVE|✅\s*Final\s+Recommendation:\s*APPROVE') {
+                $result.Outcome = 'approved'
+            }
+            elseif ($reportContent -match '(?i)Final\s+Recommendation:\s*REQUEST.CHANGES|⚠️\s*Final\s+Recommendation:\s*REQUEST.CHANGES') {


The regex pattern on line 387 uses REQUEST.CHANGES with a literal dot (.), but the pattern likely intends to match either "REQUEST CHANGES" or "REQUEST_CHANGES".

In regex, . matches any character, so this would also match "REQUESTXCHANGES" or "REQUEST-CHANGES" etc. If the intent is to match a space or underscore, the pattern should be REQUEST[\s_]CHANGES. If the intent is only to match with a space (which seems more likely based on line 384's APPROVE pattern), then it should be REQUEST\s+CHANGES.

Suggested change

elseif ($reportContent -match '(?i)Final\s+Recommendation:\s*REQUEST.CHANGES|⚠️\s*Final\s+Recommendation:\s*REQUEST.CHANGES') {

elseif ($reportContent -match '(?i)Final\s+Recommendation:\s*REQUEST\s+CHANGES|⚠️\s*Final\s+Recommendation:\s*REQUEST\s+CHANGES') {

Copilot · 2026-02-26T00:30:10Z

.github/scripts/shared/Update-AgentLabels.ps1

+        Gate phase result: 'passed', 'failed', or $null (skipped)
+
+    .PARAMETER FixResult
+        Fix phase result: 'win' (PR best), 'lose' (alternative better), or $null (skipped)


The parameter documentation comment on line 214 is backwards. It says:

'win' (PR best), 'lose' (alternative better)

But based on the actual logic in the function (lines 259-286) and the label descriptions:

'win' → applies s/agent-fix-win → "AI found a better alternative fix than the PR" (agent wins, not PR)

'lose' → applies s/agent-fix-pr-picked → "AI could not beat the PR fix — PR is the best" (agent loses, not alternative better)

The comment should say: 'win' (agent found better alternative), 'lose' (PR is best) which is already correctly stated on line 219.

Suggested change

Fix phase result: 'win' (PR best), 'lose' (alternative better), or $null (skipped)

Fix phase result: 'win' (agent found better alternative), 'lose' (PR is best), or $null (skipped)

kubaflo changed the title ~~Add agent workflow metrics labels (s/agent-* prefix)~~ Agent Workflow Metrics via GitHub Labels Feb 10, 2026

rmarinho and others added 29 commits February 11, 2026 20:56

[ci] Add copilot CI

a97fd0d

Install GH

539bdcf

Try again

62109e9

skip codeql

e0e367a

Pipe the output

55c0331

Cleanup file

3a81e85

fix prompt

69326b2

Fixx

ca59d6d

Fix it again

e8fc0a3

add provisioning

f7b5a58

again

6339ba8

try update

25f4df2

skip certs

747523e

skip certs

4370f80

comment

23486ca

try again

d4f77f0

Update script

7acc340

Try again

26447f2

Try list first

bc5e3f1

even before

cc81674

again

98223da

Again

4b50a0b

Updated the prompt

de8f70d

Remove agent sessions

4a4f213

kubaflo added 7 commits February 13, 2026 13:51

Revert labels logic

66788a1

kubaflo force-pushed the agent-labels branch from 677d745 to fc148e2 Compare February 19, 2026 20:15

kubaflo and others added 15 commits February 20, 2026 01:29

[ci] Fix variables

cc37d34

Second test

Merge branch 'copilot-ci-rui' into copilot-ci

31a95a7

[ci] Publish-Build-Assets just for pack/release build

1dfaf81

Merge branch 'copilot-ci-rui' into copilot-ci

11c40ba

Update ci-copilot.yml

66a705a

Revert "Add agent label automation and docs"

a447b11

This reverts commit 28bc3cc.

Merge branch 'main' into copilot-ci

9c8824a

Update provision.yml

515a631

kubaflo force-pushed the agent-labels branch from de1a7e8 to a530d85 Compare February 26, 2026 00:24

kubaflo marked this pull request as ready for review February 26, 2026 00:24

Copilot AI review requested due to automatic review settings February 26, 2026 00:24

Copilot started reviewing on behalf of kubaflo February 26, 2026 00:25 View session

kubaflo added area-ai-agents Copilot CLI agents, agent skills, AI-assisted development copilot labels Feb 26, 2026

Copilot AI reviewed Feb 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Workflow Metrics via GitHub Labels#33986

Agent Workflow Metrics via GitHub Labels#33986
kubaflo wants to merge 130 commits intodotnet:mainfrom
kubaflo:agent-labels

kubaflo commented Feb 10, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	Gate labels (`gate-passed`/`gate-failed`) are mutually exclusive with each other. Fix labels (`fix-win`/`fix-lose`) are mutually exclusive with each other.
	Gate labels (`gate-passed`/`gate-failed`) are mutually exclusive with each other. Fix labels (`fix-win`/`fix-pr-picked`) are mutually exclusive with each other.

	# PRs where agent found a better fix
	# PRs where agent could NOT beat the PR fix (PR fix was best)

	- group: Publish-Build-Assets # This variable group contains secrets to publis to BAR
	- group: Publish-Build-Assets # This variable group contains secrets to publish to BAR

	elseif ($reportContent -match '(?i)Final\s+Recommendation:\sREQUEST.CHANGES\|⚠️\sFinal\s+Recommendation:\s*REQUEST.CHANGES') {
	elseif ($reportContent -match '(?i)Final\s+Recommendation:\sREQUEST\s+CHANGES\|⚠️\sFinal\s+Recommendation:\s*REQUEST\s+CHANGES') {

	Fix phase result: 'win' (PR best), 'lose' (alternative better), or $null (skipped)
	Fix phase result: 'win' (agent found better alternative), 'lose' (PR is best), or $null (skipped)

Conversation

kubaflo commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Agent Workflow Labels

Label Categories

Outcome Labels

Signal Labels

Tracking Label

Manual Label

How It Works

Architecture

How Labels Are Parsed

Self-Bootstrapping

Querying Labels

Common Queries

Metrics You Can Derive

Implementation Details

Files

Key Functions

Design Principles

Migrated From

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kubaflo commented Feb 10, 2026 •

edited

Loading