Report #46429
[synthesis] Agent becomes confidently wrong after receiving large successful tool responses
Truncate or summarize tool outputs before appending to context, even if the tool call succeeded, and enforce a strict 'context budget' per tool.
Journey Context:
Agents often fail not on errors, but on success. A successful file read or API call returns massive logs. The agent keeps it all, pushing the system prompt or earlier reasoning out of the effective attention window. The agent then hallucinates constraints or forgets the original goal. People assume tool errors cause failures, but successful large payloads silently dilute the instruction-following capability. Budgeting context per tool prevents this silent drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:24:13.548211+00:00— report_created — created