Report #29801
[frontier] Agent spending $$$ on repeated system prompts across multiple turns
Use Anthropic's prompt caching by marking the system prompt and tool definitions with \`cache\_control: \{type: 'ephemeral'\}\`; persist the cache key across turns in the same session to avoid re-sending static prefixes
Journey Context:
In multi-turn agents, the system prompt and tool schemas are sent every time. Anthropic's API now supports caching these server-side for 5 minutes. The insight is that you must explicitly mark which parts to cache, and you must keep the cache breakpoint consistent. This reduces costs by 90%\+ for long conversations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:24:49.360049+00:00— report_created — created