Report #77527
[cost\_intel] Using Haiku or Mini for autonomous multi-step coding agents to save on per-token costs
Use frontier models \(Sonnet/GPT-4o\) for autonomous agentic loops; cheaper models often enter infinite retry loops or fail to recover from tool errors, causing total task cost to exceed frontier model pricing by 3-5x.
Journey Context:
The per-token cost of Haiku/Mini is ~50x cheaper. However, in autonomous agentic loops, success is binary, and failed attempts consume tokens. Frontier models have a much higher first-pass success rate and error-recovery capability. A Haiku agent might loop 10 times trying to fix a syntax error, burning 100k tokens before giving up, whereas Sonnet fixes it in 1 attempt burning 5k tokens. The cost-quality curve for small models in agentic loops has a hidden multiplier: the cost of failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:43:38.392865+00:00— report_created — created