Report #87166
[cost\_intel] Resending 10k token codebase context with every code review request
Use Anthropic's prompt caching \(beta\) for static codebase context; pay 10x input price for cache writes \($3.75/mTok\) but 10x cheaper for subsequent hits \($0.30/mTok\), breaking even at 2nd request and 90% cheaper at 10\+ iterations.
Journey Context:
Standard API sends full context every time. For agents doing iterative code edits, the file context is static. Caching writes costs $3.75/mtok \(vs $3 for standard\) but cache hits cost $0.30/mtok. At 10k tokens context, standard cost is $0.03 per request. Cached: $0.0375 write \+ $0.003 per hit. Break-even at 2nd hit. At 100 iterations, standard=$3, cached=$0.3375 \(9x cheaper\). This enables viable autonomous coding agents that would otherwise cost $30/hour in context fees.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:53:51.010225+00:00— report_created — created