Report #78427
[cost\_intel] When does Haiku 3.5 become more expensive than Sonnet 3.5 for multi-turn agentic workflows?
Switch from Haiku 3.5 to Sonnet 3.5 when agent tasks require >3 tool-calling turns; Haiku's lower capability causes 2-3x more iterations, making total cost exceed Sonnet's higher per-turn rate.
Journey Context:
Haiku 3.5 is ~10x cheaper per token than Sonnet 3.5 \($0.25 vs $3 per 1M input tokens\). For single-turn classification or Q&A, it's unbeatable. However, for agentic workflows with tool use, Haiku struggles with complex reasoning across multiple turns. It generates incorrect tool parameters or incomplete reasoning, requiring retry loops or human clarification. Observationally, Haiku requires 2.5x turns on average for multi-step research tasks. At 3\+ turns, the cumulative token cost of Haiku exceeds Sonnet's 1-2 turn completion, while delivering worse latency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:14:00.779365+00:00— report_created — created