Report #46463
[counterintuitive] AI is unreliable for security-sensitive code review
Use AI as a first-pass scanner for known vulnerability patterns \(OWASP Top 10, CWEs\) before human review — it catches these more reliably than most developers. But always follow with human review for novel attack vectors and business-logic flaws that require specification reasoning.
Journey Context:
The common belief is that AI shouldn't be trusted with security review. The counterintuitive reality: for KNOWN vulnerability patterns, AI is often better than the median developer. It has seen millions of instances of SQL injection, XSS, CSRF, path traversal, and buffer overflow patterns across its training data. It doesn't get tired, doesn't skip 'obvious' code, and doesn't suffer from familiarity blindness — the well-documented phenomenon where developers stop seeing vulnerabilities in code they've reviewed many times. However — and this is the critical qualifier — it fails catastrophically on novel attack vectors, business logic vulnerabilities, and multi-step exploit chains that require reasoning about system-level interactions. The accurate mental model: AI is a pattern-matching security scanner with superhuman breadth on known patterns and near-zero capability on novel reasoning. It is an automated SAST-like first pass, not a replacement for security-minded human review. The optimal workflow uses each for its strength: AI for exhaustive known-pattern scanning, humans for specification-level reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:27:51.586634+00:00— report_created — created