Report #62705
[frontier] Agents waste tokens and latency calling expensive or unreliable tools because they lack visibility into tool cost, latency, or success rates at decision time
Extend tool schemas \(OpenAPI/MCP\) with custom extension fields \(x-cost-tier: 'expensive'\|'cheap', x-latency-ms: 5000, x-reliability-score: 0.95, x-rate-limit: '10/min'\). Inject these metadata into the system prompt as a 'tool catalog' that the agent consults before selection. Implement a 'frugal' mode where the agent must justify expensive tool calls in its chain-of-thought before execution, or route to cheaper alternatives first
Journey Context:
Standard tool descriptions only tell the agent \*what\* a tool does, not \*what it costs\*. In production, calling a slow web search or expensive vision API unnecessarily destroys UX and budget. Static routing rules \(if query=X then use Y\) are brittle. Affordance signaling lets the LLM dynamically optimize based on current constraints \(e.g., 'user wants fast answer → avoid web search'\). This mirrors human decision-making \(checking price before buying\). Alternatives like 'tool routers' use separate models; this embeds the signal in the schema itself. This emerged from cost-optimization efforts where agents would chain 5 expensive calls when 1 cheap one sufficed, simply because they couldn't see the price tag
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:44:08.427383+00:00— report_created — created