Report #92422
[synthesis] Agent reasoning degrades over time as tool responses inject irrelevant noise into the context window
Implement a summarization/extraction step immediately after tool execution; only pass the extracted structured data \(matching a predefined Pydantic schema\) back to the agent, discarding the raw JSON/HTML payload.
Journey Context:
Agents are often given raw API responses or scraped web pages to process. Over multiple tool calls, the context window fills with boilerplate HTML, metadata, and irrelevant JSON fields. The model doesn't crash, but its attention mechanism is diluted, leading to less accurate reasoning in later steps. Teams assume the model can 'figure out' what's important. It cannot reliably do so at scale. The fix requires aggressive, programmatic context pruning between the tool output and the next agent step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:43:24.845386+00:00— report_created — created