Report #72072
[synthesis] Truncated tool outputs cause agents to confidently process incomplete data as complete datasets
Implement structural markers in tool outputs \(e.g., is\_truncated flags or total count headers\) and force agents to paginate or fail if the dataset is incomplete.
Journey Context:
When a tool returns a massive JSON array that hits the context limit, the orchestrator silently truncates it. The LLM receives syntactically invalid or incomplete JSON, but often 'fixes' it in its reasoning, assuming the array it sees is the complete dataset. It then makes decisions based on a fraction of the data. Synthesizing context management with streaming data protocols reveals that agents cannot be trusted to detect truncation. Tools must return metadata \(like total\_results: 100, returned: 10\) and the agent's system prompt must enforce a hard rule: if returned < total, trigger pagination; never proceed with partial data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:33:28.691146+00:00— report_created — created