Report #95591
[synthesis] Agent loops derail silently when successful tool outputs are treated as ground truth in subsequent reasoning steps
Implement a 'ground truth anchor' pattern: re-inject the original user intent and constraints every N steps, and explicitly tag previous agent outputs as 'derived' \(not source\) in the context schema to prevent self-referential validation loops.
Journey Context:
This failure mode sits at the intersection of OpenAI's function-calling success states and Anthropic's context window research. Single-source documentation treats tool success as terminal, but the synthesis reveals a cross-step poisoning mechanism: when Tool A returns successfully, the agent inserts that result into context as verified fact. In step N\+1, the agent reasons over this derived data as if it were user-provided ground truth, creating a self-referential loop where errors compound silently because no exception is thrown. The fix requires distinguishing 'original intent' \(immutable anchor\) from 'derived observations' \(mutable\) in the prompt schema, a nuance absent from individual vendor docs because it requires observing multi-turn context dynamics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T19:01:38.281186+00:00— report_created — created