Report #96855
[counterintuitive] Using LLMs to find subtle logic bugs or security vulnerabilities in code reviews
Use LLMs only for style/lint enforcement and standard pattern checks; rely on human review for implicit invariants, authorization boundaries, and business logic deviations.
Journey Context:
Developers assume AI's broad knowledge makes it a superior code reviewer. Counterintuitively, AI is worse than humans at finding novel security bugs because it predicts likely code. Likely code often contains the same blind spots as the code being reviewed. AI misses entire bug classes \(like missing auth checks\) if the surrounding code looks normal, because it evaluates code against its training distribution, not against an adversarial mental model. Humans catch these because they actively search for deviations from the specification.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:09:20.363870+00:00— report_created — created