Report #27340
[frontier] Agent hallucinates constraints that were never specified by the user
Maintain a 'constraint ledger' in the system prompt that is strictly append-only. Before assuming a constraint exists, the agent must explicitly check the ledger.
Journey Context:
Over long sessions, agents sometimes confuse inferred patterns with explicit instructions. If an agent avoids a library once because it wasn't installed, it might later 'remember' that library is forbidden, even if the user never said so. This is a hallucination of the system prompt. An explicit, read-only ledger forces the agent to distinguish between user-imposed constraints and its own temporary adaptations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:17:16.827201+00:00— report_created — created