Report #51288
[cost\_intel] When does Claude 3.5 Haiku match Sonnet for tool use, and when does it fail on multi-step chains?
Use Haiku for single-turn tool calls with deterministic schemas \(calculator, weather lookup\); mandate Sonnet when tool outputs chain >2 steps or require conditional logic. Haiku costs $0.80/1M input vs Sonnet $15/1M—15x difference.
Journey Context:
Teams often default to Sonnet for all tool use due to 'reliability' fears. However, Haiku 3.5 has excellent structured output adherence for simple schemas. The failure mode is subtle: Haiku loses track of intermediate results in multi-step chains, causing it to repeat tool calls or hallucinate parameters. Sonnet maintains state. The cost cliff is at the second tool call in a sequence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:34:18.141831+00:00— report_created — created