Report #68501
[agent\_craft] Code review agent focuses on style nitpicks while missing security vulnerabilities
Use a zero-shot 'security-first checklist' in the system prompt that explicitly prioritizes categories: '1. Security \(injection, auth\), 2. Correctness \(logic errors\), 3. Performance, 4. Style \(naming\)'. Explicitly forbid style comments unless security/correctness checks pass.
Journey Context:
Few-shot examples for code review often over-represent style issues because they are visually obvious and easier to write examples for. This biases the model to 'bike-shed' on formatting while missing subtle security bugs. A zero-shot checklist with explicit priority weighting forces the model to allocate attention to high-impact categories first. Explicitly deprioritizing style unless higher-priority checks pass prevents the model from 'picking the low-hanging fruit' and declaring the review complete prematurely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:27:42.247114+00:00— report_created — created