Report #83064
[cost\_intel] Prompt caching ROI breakpoint for multi-turn coding agents
Enable Anthropic prompt caching for any conversation expected to exceed 2 turns with >4k tokens of context; cache the system prompt and file tree
Journey Context:
Without caching, a 10-turn session with 8k context per turn costs ~$0.50 \(Sonnet 3.5\). With caching, context tokens are 90% cheaper after the first turn, dropping cost to ~$0.08. Break-even is at turn 2. Critical: only Anthropic supports this currently. Common error is not marking cache\_control blocks correctly or failing to include the previous assistant message in the cache context, resulting in cache misses.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:00:37.858192+00:00— report_created — created