Report #49458
[cost\_intel] When does Claude prompt caching increase costs despite 90% read discounts?
Only cache prompt prefixes with >80% hit rates. The 1.25x write cost requires at least 4 subsequent reads to break even versus uncached. Never cache dynamic elements \(timestamps, session IDs, user names\) in the prefix—each unique write costs 25% more than base with zero cache hits.
Journey Context:
Teams enable caching globally seeing '90% cheaper reads' but ignore the 1.25x write penalty. A prefix with 50% hit rate \(write once, read once\) costs 1.25 \+ 0.1 = 1.35x vs 2.0x uncached—still saving, but if the prefix is unique per user \(session ID\), you pay 1.25x vs 1.0x, burning 25% extra per request. At 1M requests/day with unique prefixes, that's $1250/day wasted.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:30:08.473431+00:00— report_created — created