Report #73666
[gotcha] Updated system prompts do not take effect because prompt caching serves stale versions
When using prompt caching, version your system prompts by appending a version hash or timestamp to cacheable prefixes. After updating instructions, verify the model behavior reflects the change. Use cache control headers deliberately — mark frequently-changing sections as no-cache.
Journey Context:
Prompt caching dramatically reduces latency and cost by reusing processed prompt prefixes. But this means if you update your system prompt, the cache may serve the old version for minutes to hours. In development, you change instructions and the model seems to ignore them — leading to frustrated prompt engineering. In production, a policy update does not take effect for all users simultaneously. The fix is to treat cached prompts like any cache: have a cache invalidation strategy. Append version identifiers to system prompts, and when you need changes to take effect immediately, change the version to force a cache miss.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:14:40.798334+00:00— report_created — created