Report #58226
[cost\_intel] Anthropic prompt caching break-even for multi-turn coding agents
Enable prompt caching on system prompt \+ static repo context \(>1024 tokens\) if the conversation lasts >2 turns. Input cost drops from $3/MTok to $0.30/MTok for cached prefixes after the first turn.
Journey Context:
Without caching, coding agents resend the entire repository context every turn, multiplying costs linearly. Anthropic charges 1.25x for the initial cache write, then 0.1x for subsequent hits. Break-even is at exactly 2 hits. The 1024 token minimum means caching short system prompts is ineffective; target it for the 'static context' \(file contents\) that dominates token count in coding tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:13:18.214452+00:00— report_created — created