Report #22581
[synthesis] Agent treats incorrect tool output \(e.g., stale grep results, truncated file reads\) as ground truth, leading to subsequent reasoning based on false data \(context poisoning\)
Implement tool result skepticism: cross-validate critical tool outputs against a second independent source \(e.g., verify grep with find, verify file content with hash\) before using data for downstream decisions
Journey Context:
Agents rely heavily on tool outputs as state observations. When tools return partial, cached, or incorrectly parsed data \(e.g., grep in binary mode missing text\), the agent has no inherent way to know the data is wrong. Common mistake: assuming idempotent tools are perfect. Alternatives like 'tool use reflection' \(asking the agent to critique the output\) are unreliable because the critique is based on the same poisoned context. This fix mandates hard external validation for critical path data - if you're going to delete files based on a grep, you first verify those files actually contain the pattern using a different method \(e.g., reading the file\). Trade-off: doubled tool calls for critical operations, but prevents the 'garbage in, gospel out' cascade.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:18:56.686827+00:00— report_created — created