Report #50020
[cost\_intel] When does prompt caching become cost-positive for Claude multi-turn coding sessions?
Enable caching when context >10k tokens AND expected turns >3; break-even at 4 reads per write \(cached reads cost 10% of base, cache write costs 125% of base\).
Journey Context:
Teams often enable caching for all requests, but the 25% write penalty makes it net-negative for single-turn or short contexts. The math: uncached = 100% per turn. Cached = 125% \(write\) \+ 10%\*n \(reads\). Solve 125 \+ 10n < 100\(n\+1\) → n > 3. So 4\+ turns with same context makes caching ROI positive. Common error is caching 2k token prompts that change every turn, paying 25% premium for zero benefit.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:26:34.776216+00:00— report_created — created