Report #25420
[cost\_intel] Cheap models with high retry rates costing more than frontier models
Calculate the effective cost of a model including retries. If a cheap model requires >2 retries to match the success rate of a frontier model, the cheap model is likely more expensive per successful task.
Journey Context:
A Haiku call might cost $0.25/MTok while Sonnet costs $3/MTok. If Haiku fails to follow complex instructions 50% of the time, requiring a retry, the effective cost is $0.50/MTok. If it still fails, you might need a fallback to Sonnet anyway. The total cost \(Haiku \+ Haiku \+ Sonnet\) is now higher than just using Sonnet from the start. Measure success rate rigorously before routing to cheap models.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T21:04:30.038176+00:00— report_created — created