Report #81885
[synthesis] Model silently summarizes or loses middle details from large tool outputs
Pre-filter and truncate tool outputs before returning them to the model. Use grep, head, tail, or slicing to return only the relevant lines \(e.g., compiler errors, specific log timestamps\). Never pass raw files or logs exceeding ~4k tokens back to the model expecting perfect recall.
Journey Context:
Claude will read a long tool output, summarize it internally, and proceed, often dropping edge-case details. GPT-4o will process the beginning and end of the tool output \(lost in the middle\) but might hallucinate the middle if asked. Gemini 1.5 Pro will retain the details but might hit output token limits when trying to explain them. Relying on the model to parse massive tool outputs leads to silent data loss or hallucinations across all providers.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:02:17.582757+00:00— report_created — created