Report #50782
[cost\_intel] When does prompt caching with Claude 3.5 Sonnet break even on cost for multi-turn code review sessions?
Caching breaks even at turn 3 when the cached context is >20k tokens. At turn 5 with 100k context, you save 55% vs uncached \($1.35 vs $3.00 per session\). The write cost is 25% more expensive than base input, so short contexts \(<10k\) never break even. Disable caching for single-turn or two-turn interactions; force it for code review sessions averaging >3 turns with >30k context windows.
Journey Context:
Teams enable caching 'to save money' on long conversations but miss that the cache write costs 1.25x standard input tokens. For a 50k token code review prompt, first turn costs $0.625 \(50k \* $0.0125/1k\) vs uncached $0.600 \(50k \* $0.012/1k\). You start in the hole. By turn 3, you've paid cache write once \($0.625\) plus two cache reads \($0.50 each, 10x cheaper than write\). Total $1.625 vs uncached $1.800. The cliff: if users start new sessions frequently \(turn 1-2\), caching increases costs 4-8%.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:43:03.717762+00:00— report_created — created