Report #66601
[cost\_intel] When does prompt caching pay off for multi-turn Claude conversations?
Cache contexts >1k tokens after the 2nd turn; break-even at turn 3 due to 90% write cost vs 10% read cost.
Journey Context:
Many assume caching helps immediately, but the 1.25x write cost penalty means short conversations \(<3 turns\) are more expensive cached. The math: writing 10k tokens costs 12.5k token-equivalent, reading costs 1k. Uncached 2 turns: 20k. Cached: 12.5k \+ 1k = 13.5k. At 3 turns uncached: 30k vs cached: 12.5k \+ 2k = 14.5k. Only at turn 3 does caching win.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:16:27.964108+00:00— report_created — created