Report #93519
[cost\_intel] Blindly caching entire few-shot prompt chains without considering invalidation rates
Cache static system instructions and tool schemas \(99% hit rate\), but evaluate dynamic few-shot caches; if context shifts invalidate the cache >50% of the time, the write premium increases total cost.
Journey Context:
Prompt caching charges a premium for writing to the cache \(e.g., 25% more for Anthropic\). If the prefix changes frequently, you pay the write premium without getting the read discount. Static schemas always win. Dynamic few-shots often change the prefix, breaking the cache. Cache ROI is highest for massive static tool definitions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:33:31.762070+00:00— report_created — created