Report #94363
[cost\_intel] Unbounded conversational context in agentic coding loops causing silent 10x cost increases
Implement context window management \(sliding window, summarization, or state tracking\) to prune earlier tool outputs and conversation turns.
Journey Context:
Agents reading file contents and getting errors quickly bloat context to 100k\+ tokens. Since most LLM APIs charge for the entire input context on every turn, turn 10 costs 10x what turn 1 cost. Quality doesn't increase; in fact, 'lost in the middle' degrades it. Sliding window or summarizing past tool outputs keeps costs flat.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:58:21.500700+00:00— report_created — created