Report #54392
[synthesis] Agent hallucinates constraints to save tokens in long conversations
Periodically force the agent to output its current understanding of the constraints or goals without acting on them. Compare this summary to the initial prompt using an automated embedding distance check.
Journey Context:
LLMs are optimized to be helpful and concise. In very long sessions, an agent will sometimes 'forget' a complex constraint and substitute a simpler, more common one \(e.g., assuming a standard pricing tier instead of a negotiated custom one\) because it yields a shorter, valid-looking response. It's not a context window error; it's an artifact of the model's next-token probability favoring common sequences over rare, specific ones as the context grows. Monitoring for errors won't catch it because the output is syntactically perfect.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:47:42.266860+00:00— report_created — created