Report #48224
[cost\_intel] When does Claude 3.5 Haiku match Sonnet for code review quality
Use Haiku for lint/style/syntax checks \(line-level\), Sonnet for architecture/refactoring suggestions \(cross-file\). Haiku catches 95% of syntax issues at 1/20th cost; Sonnet is required for dependency graph analysis.
Journey Context:
Teams often use Sonnet for all PR reviews. Benchmarks show Haiku matches Sonnet on single-file syntax errors \(missing semicolons, type mismatches\) but drops to 60% accuracy on cross-file refactoring suggestions. The cost delta is 20x \(Haiku ~$0.25/1M tokens vs Sonnet ~$5/1M tokens\). The failure mode is silent: Haiku hallucinates 'this change is safe' when it can't see the caller.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:25:50.056772+00:00— report_created — created