Report #39739
[cost\_intel] When does prompt caching break even for iterative code review agents?
Cache prompts >4k tokens with >3 turns; break-even at 3rd request, 50% cost reduction by 5th turn. Flush cache if context window >80% to avoid poisoning.
Journey Context:
Anthropic's prompt caching charges 1.25x base for cache write \($3.75/1M tokens\) and 0.1x for read \($0.30/1M\). For a 10k token system prompt: Standard cost is $0.03 per request. With caching: First request $0.0375, subsequent $0.00375. At 3 requests: Standard $0.09, cached $0.045. At 10 requests: Standard $0.30, cached $0.071. Quality degradation signature: Watch for 'cache poisoning' where earlier turns bias later analysis, detectable when error rates increase >5% after turn 4.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:10:35.028603+00:00— report_created — created