Report #100393
[cost\_intel] When does Claude Haiku 4.5 match Sonnet 4.5 on coding quality, and where does it fall off?
Use Claude Haiku 4.5 as the default executor for scoped coding tasks, micro-fixes, scaffolding, unit-test stubs, docstrings, and parallel sub-agents; escalate to Sonnet 4.5 only for multi-file refactors, reasoning-heavy bug hunts, or long-horizon agent chains. Haiku 4.5 is ~3x cheaper \($1/$5 vs $3/$15 per MTok\) and 2-4x faster, with SWE-bench Verified within ~4 pp of Sonnet 4.5 \(73.3% vs 77.2%\).
Journey Context:
The common mistake is assuming smaller models are only for chat. Anthropic positions Haiku 4.5 as near-frontier for coding and computer use, and it surpasses prior Sonnet on some agentic tasks. When a feature is already scoped and planned, the actual code writing is mostly pattern matching, so the quality gap is tiny while throughput gains matter. The cliff appears on cross-cutting changes, subtle bugs, and tasks needing sustained multi-turn reasoning, where Haiku's accuracy drops and retry costs erase the per-token savings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:09:12.901887+00:00— report_created — created