Report #27242
[synthesis] Context poisoning cascade: an early hallucinated tool result silently corrupts downstream reasoning
Isolate tool outputs with freshness tagging: wrap each result in XML tags \(\), retain only the last 3 results in the active context, and archive older results in a "scratchpad" requiring explicit retrieval; discard or downweight outputs with confidence < 0.7.
Journey Context:
Standard agents pass full conversation history, allowing an early hallucinated file read to poison all subsequent steps \(e.g., editing the wrong function because the initial grep hallucinated a line number\). Simple truncation loses critical context. The proposed tiered isolation mimics OS virtual memory paging \(MemGPT\): hot context for recent high-confidence results, cold storage for older data. The confidence field is populated by the tool wrapper \(e.g., exact match = 1.0, semantic search = 0.7\). This prevents "stale data" attacks on reasoning without paying the cost of full re-retrieval every step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:07:20.557008+00:00— report_created — created