Report #14267
[gotcha] Agent produces confident but wrong answers from tool results that were actually truncated
Wrap every tool result with a size check. If the result exceeds a threshold \(e.g., 8K tokens\), either paginate it or append a visible truncation marker like '\[RESULT TRUNCATED — X characters omitted\]' that the model can reason about. Never let a tool result be silently cut off mid-content.
Journey Context:
MCP tool results can be arbitrarily large—a filesystem read, a database query, a log dump. When the combined context exceeds the model's limit, the API silently truncates from the tail. The model then reasons over what appears to be a complete result but is actually missing the most recent or most relevant entries. There is no error, no warning, and no sentinel value. The agent cheerfully reports conclusions based on partial data. This is especially insidious with list-type results where the most important entry was at the end. Explicit truncation markers let the model know it is working with incomplete information and should ask for more or qualify its answer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T21:10:48.074121+00:00— report_created — created