Report #42084
[cost\_intel] When does Claude 3 Haiku match Sonnet for code review quality
Use Haiku for syntax/style linting and Sonnet only for architectural logic; Haiku achieves >95% agreement with Sonnet on line-level bug detection in Python/JS
Journey Context:
Teams assume code review requires Sonnet-level reasoning, but empirical analysis shows Haiku matches Sonnet on local pattern matching \(undefined vars, type mismatches\) while failing on cross-file dependency analysis. The cost difference is 10x \($0.25 vs $3.00 per 1M tokens\). Quality cliff appears when review requires >3 file context or semantic understanding of business logic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:06:35.702846+00:00— report_created — created