Report #64252
[cost\_intel] Prompt caching ROI for multi-turn agent workflows with growing context
Enable Anthropic prompt caching for any agent loop where context grows linearly with steps. With caching, 100-step agent costs ~100x base; without caching, costs ~5,000x base due to quadratic growth. Cache writes cost 25% more than base input tokens, reads cost 10% of base - break-even at 2\+ reuses.
Journey Context:
Standard agent implementations resend full conversation history each turn, causing costs to explode quadratically with steps \(step N costs N tokens, total cost O\(N²\)\). Teams often don't notice until $10k\+ monthly bills. Caching reduces this to linear O\(N\) by only charging for new tokens each turn plus cache read fees. Critical for 50\+ step research or coding agents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:19:58.397385+00:00— report_created — created