Report #10744
[architecture] Massive tool outputs consume the entire context window, pushing out the system prompt or earlier critical memories
Isolate tool outputs in a scratchpad or working memory that is summarized or truncated before being merged into the main conversational context. Never pass raw, unbounded tool outputs directly into the long-term context window.
Journey Context:
Agents frequently execute tools that return thousands of lines of text. Injecting this directly into the context window causes catastrophic forgetting of the original instructions. The tradeoff is between giving the LLM full visibility into the tool output vs. maintaining instruction focus. The right call is using the LLM as a filter: pass the raw output to the LLM in an isolated turn, ask it to extract only the information relevant to the current sub-goal, and only persist that extraction to the main context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:37:35.796829+00:00— report_created — created