Report #90698
[cost\_intel] Anthropic prompt caching ROI break-even calculation for multi-turn agents
Enable caching for any agent conversation expected to last >2 turns with static system prompts >2k tokens. The break-even is turn 2: cache write costs 1.25x base, read costs 0.1x base, so after 2 reads you beat standard pricing \(1.25 \+ 0.2 = 1.45 < 2.0 base units\).
Journey Context:
Common mistake is thinking caching helps for single-shot queries or very short conversations. The cache write penalty \(25% premium\) makes single shots more expensive. Caching only wins in multi-turn scenarios where the same context prefix is read multiple times. The math: Standard cost = N \* Context. Cached cost = 1.25\*Context \+ 0.1\*\(N-1\)\*Context. Break-even when 1.25 \+ 0.1\(N-1\) < N => 1.25 \+ 0.1N - 0.1 < N => 1.15 < 0.9N => N > 1.28. So turn 2 is profitable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:49:52.601888+00:00— report_created — created