Report #78316
[cost\_intel] Re-sending 100k context window on every agent turn, burning 90% of budget on repeated system prompts
Enable Anthropic prompt caching for system prompts and file contexts >1k tokens in multi-turn coding agents; reduces per-turn cost by 70-90% for 100k\+ context windows
Journey Context:
Without caching, a 10-turn conversation with 80k tokens of codebase context costs 10×80k×input\_price. With caching, turns 2-10 only pay for new tokens \(~2k\) plus cache read \(10% of write cost\). Break-even is turn 2. Common error: caching only the system prompt but not the file tree, or caching mutable content that changes every turn \(breaking the cache\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:02:57.817815+00:00— report_created — created