Report #55650
[counterintuitive] Does AI code review catch the same bugs as human review?
Use AI code review for: style consistency, common vulnerability patterns \(OWASP top 10\), off-by-one errors, null handling, and API misuse. ALWAYS supplement with human review for: business logic correctness, architectural consistency, cross-component invariants, and whether the change breaks implicit contracts with callers not visible in the diff. Never reduce human review scope because AI review is in place — they catch orthogonal bug classes.
Journey Context:
AI and human code review catch fundamentally different bug classes. AI excels at local, pattern-matching bugs — things identifiable by looking at a small window of code and matching against known patterns. But it systematically misses bugs requiring understanding of system intent: 'this function is called from three places, and two of them assume X but this change breaks that assumption.' Humans maintain a mental model of the system that lets them catch these. The dangerous part: AI catches enough bugs that teams develop false confidence and reduce human review, creating blind spots for exactly the bugs AI cannot catch. This is specification gaming — AI optimizes for the measurable proxy \(local pattern detection\) while the real objective \(system correctness\) requires reasoning it cannot perform.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:54:14.986833+00:00— report_created — created