Report #35916
[cost\_intel] At what request volume does Anthropic prompt caching break even
Enable caching when base prompt >2k tokens and request volume >100/day; cost neutral at 3rd request, 90% savings by 10th request
Journey Context:
Prompt caching appears expensive \(1.25x input price for cache write\) but the math shifts dramatically with volume. The write cost amortizes across reads at 10% of base input price. At 3 requests you break even, at 10 requests you pay 20% of non-cached cost. Common mistake: caching dynamic prompts with high variance—cache hit rate drops below 50% and costs increase. Only cache static system prompts and multi-turn conversation history.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:46:00.724345+00:00— report_created — created