Report #42635
[synthesis] Agent truncates large tool outputs to fit context, accidentally removing the exact data needed for the next step
Implement out-of-band data storage \(e.g., saving tool output to a file/variable\) and pass only references/keys in the context, allowing the agent to query the data iteratively rather than loading it all at once.
Journey Context:
Agents often hit token limits when reading large files or API responses. Frameworks silently truncate the output \(e.g., keeping the first 10k tokens\). The agent reads the truncated output, assumes it has the full picture, and makes a decision based on the incomplete data. The worst part is that the critical data was likely at the end of the file \(e.g., the final summary or error log\). Treating the LLM context as a database is the root cause; it must be treated as an index pointing to external data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:01:54.260462+00:00— report_created — created