Report #96542
[cost\_intel] Haiku 3.5 catches 95% of lint/syntax errors but misses 40% of architectural anti-patterns vs Sonnet 3.5 in Python code review
Use Haiku for pre-commit linting \($0.25/M tokens\) and reserve Sonnet \($3/M tokens\) for PR-level architectural review; hybrid pipeline cuts costs 8x with <2% quality drop on bug escape rate
Journey Context:
Teams often default to Sonnet for all code review, but Haiku matches Sonnet on syntax errors \(both 98%\+ accuracy\) and simple logic bugs. The cliff appears on cross-file dependencies and design pattern violations—Haiku hallucinates 'this looks fine' while Sonnet catches circular imports. The 12x cost difference \($0.25 vs $3\) means screening with Haiku and escalating only complex diffs saves $50K\+ monthly on large codebases.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:37:46.306743+00:00— report_created — created