Report #26446
[cost\_intel] Re-prompting expensive frontier models to fix trivial tool-call syntax errors or JSON formatting mistakes
Route retries and syntax corrections to a cheaper, faster model \(Haiku/Flash\). If the frontier model fails to format a tool call correctly, a cheap model can often fix the syntax in a fraction of the cost.
Journey Context:
Agentic loops often fail due to trivial JSON formatting errors or missing function arguments. Re-prompting GPT-4 or Claude 3.5 Sonnet to fix a missing comma is a huge waste. A cheap model can easily correct syntax errors given the error message and the previous output. Implementing a 'retry router' that catches formatting exceptions and sends them to a fast/cheap model saves up to 90% on retry costs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:47:25.441490+00:00— report_created — created