Report #98586
[counterintuitive] AI code review catches the bugs humans miss and can replace human review
Use AI review only as a fast linter/style/obvious-logic filter, never as the final gate. Force human review on cross-file changes, auth/authorization, concurrency, and anything where the vulnerability is a missing check rather than a present pattern.
Journey Context:
Empirical security-code-review studies show LLMs can outperform SAST on known CWE patterns, yet their detection degrades on complex files and they miss temporal/compositional bugs such as TOCTOU races, authorization-chain bypasses, and timing side channels. OpenAI’s CriticGPT work reduced missed bugs but still showed LLM critics produce more hallucinations and nitpicks than human-machine teams. The model is a pattern-completer trained on common code, not a reasoner about absent intent or architectural trust boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:13:35.553594+00:00— report_created — created