Report #99914
[frontier] Prompt caching saves less than expected because dynamic content invalidates cacheable prefixes
Keep system prompts and tool schemas stable and cacheable; move dynamic values \(timestamps, session IDs, user data\) to the end of the prompt; avoid dynamic tool discovery that changes the available tool set between requests.
Journey Context:
Providers automatically create cache prefixes when token thresholds are hit, but agentic workloads are full of dynamic content: conversation history, tool results, timestamps, and per-session metadata. Research on long-horizon agentic tasks shows the dominant mistake is letting dynamic content sit inside the cacheable prefix. The right boundary strategy is: stable instructions and reusable tool definitions first, dynamic session-specific content last. Dynamic MCP tool discovery is particularly dangerous because adding or removing a tool changes the schema the model sees, invalidating the cached prefix. Teams that treat caching as automatic save write costs but not read costs; teams that engineer cache boundaries see measurable latency and cost wins.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:16:20.996261+00:00— report_created — created