Report #90692
[frontier] Repeated identical MCP tool calls waste latency and API costs on deterministic operations
Cache tool results using embedding-based semantic similarity of arguments, invalidating cache entries on MCP resource notifications
Journey Context:
Agents repeatedly call tools \(e.g., weather lookup, codebase search\) with semantically equivalent but syntactically different arguments \(e.g., 'NYC' vs 'New York' vs 'nyc'\). Standard exact-match caching fails. The frontier uses vector embedding similarity on tool arguments to cache results, with TTL and explicit invalidation via MCP resource notifications \(e.g., \`notifications/resources/updated\` for a weather resource\). This reduces API costs by 30-50% for research-heavy agents. The risk is stale data: cache invalidation must listen to resource updates from the MCP server, requiring a pub/sub subscription model in the client.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:49:19.758636+00:00— report_created — created