Codestin Search App

tmchow · 2026-03-26T05:22:54Z

The agent-native-reviewer had the right philosophy but couldn't navigate real codebases -- it told the agent WHAT to check without helping it figure out HOW to find things in an unfamiliar project. This rewrites the system prompt to close that gap.

Key changes:

Triage step (step 0) -- before any review work, the agent determines whether agent integration exists, identifies the tech stack, and decides whether this is an incremental review or full audit. Previously it dove straight into "Map the Landscape" with no orientation.
Stack-specific search strategies -- a table mapping 6 common stacks (Vercel AI SDK, LangChain, OpenAI Assistants, Claude Code plugins, Rails + MCP, generic) to concrete file patterns for finding UI actions and agent tools.
Prioritization heuristics -- three tiers (must-have / should-have / low-priority parity) so core domain CRUD gets flagged as Critical while missing parity on a settings page is an Observation at most.
"What You Don't Flag" section -- prevents false positives on intentionally human-only flows (CAPTCHA, 2FA, OAuth consent, biometrics, platform gates).
Noun Test (step 6) -- restored from the original's "Write to Location" heuristic. For every domain entity, checks context injection + action parity + discoverability in one pass.
Confidence calibration -- aligns with the ce-review ensemble pattern used by peer agents like correctness-reviewer.
Nuanced "primitives over workflows" -- workflow tools flagged for review, not categorically rejected. Safety-critical atomic sequences and external system orchestration are acknowledged exceptions.

Also adds missing frontmatter fields (color: cyan, tools: Read, Grep, Glob, Bash) and collapses the original's 7 verbose anti-pattern subsections into a scannable reference table. Net result: 192 lines down from 262, with substantially more actionable guidance.

🤖 Generated with Claude Opus 4.6 (1M context, extended thinking) via Claude Code

…tack-aware search - Add missing frontmatter fields (color, tools) to match peer review agents - Add triage step (step 0) to orient the agent in unfamiliar codebases - Add stack-specific search strategies for common frameworks (Vercel AI SDK, LangChain, OpenAI Assistants, Claude Code plugins, Rails + MCP) - Add prioritization heuristics so not every missing tool is a Critical finding - Add incremental vs. full audit guidance for the most common review scenario - Add confidence calibration to align with the ce-review ensemble pattern - Add "What You Don't Flag" section to prevent false positives on intentionally human-only flows (CAPTCHA, 2FA, OAuth consent, biometrics) - Restore the "Noun Test" heuristic as step 6 in the review process - Nuance "primitives over workflows" with justified exceptions - Collapse verbose anti-pattern subsections into a scannable reference table - Cut from 262 lines to 192 while adding substantial new guidance

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 2a42f5111a

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

- Align noun-test severity with priority tiers from step 2

chatgpt-codex-connector Bot reviewed Mar 26, 2026

View reviewed changes

Comment thread plugins/compound-engineering/agents/review/agent-native-reviewer.md Outdated

Address PR review feedback (#387)

4c01b73

- Align noun-test severity with priority tiers from step 2

tmchow merged commit e792166 into main Mar 26, 2026
2 checks passed

This was referenced Mar 26, 2026

chore: release main #397

Merged

chore: release main #427

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: improve agent-native-reviewer with triage, prioritization, and stack-aware search#387

fix: improve agent-native-reviewer with triage, prioritization, and stack-aware search#387
tmchow merged 2 commits into
mainfrom
improve-agent-native-reviewer

tmchow commented Mar 26, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tmchow commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tmchow commented Mar 26, 2026 •

edited

Loading