Report #56029
[cost\_intel] Prompt caching \(Anthropic\) not cost-effective for one-shot or low-reuse prompts
Enable caching only when prompt prefix >4k tokens and reuse count ≥3; break-even is 2.5 reuses at 10k tokens, 1.5 reuses at 100k\+ tokens
Journey Context:
Caching has write-cost penalty \(1.25x base token cost for cache-write vs 0.1x for cache-read\). For a 10k token prompt: writing costs $0.375 \(10k × $0.00375 for Sonnet\), each read costs $0.03 \(10k × $0.0003\). First use: $0.375. Second use: $0.03 saved vs $0.30 standard = $0.27 saved. Break-even at 2.5 uses. Below 4k tokens, write overhead exceeds read savings until 5\+ uses.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T00:32:20.236501+00:00— report_created — created