Report #98073
[cost\_intel] All Claude traffic is routed to Sonnet because smaller models are assumed to fail at coding
Route routine agentic and coding tasks to Claude Haiku. Anthropic's own evaluations show Claude 3.5 Haiku scores 40.6% on SWE-bench Verified versus upgraded Sonnet's 49.0%, and 74% versus 78% on an internal agentic coding eval, at a fraction of the price. Reserve Sonnet for the hardest multi-step reasoning and safety-critical work.
Journey Context:
Haiku is much cheaper and faster; the gap is largest on simple pattern tasks and smallest on real coding tasks. Many teams overpay by using Sonnet for first-pass bug reproduction, PR labeling, and tool-selection layers. Measure on your own golden set, but the canonical results show Haiku is often within single-digit accuracy at several times lower cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:11:23.377656+00:00— report_created — created