Report #80567
[frontier] Agent remembers how to use tools but forgets permission constraints after 50 turns
Decouple constraints from the prompt. Move constraint enforcement to deterministic, pre-execution guardrails \(e.g., a validation layer\) rather than relying on the LLM to remember them from the system prompt.
Journey Context:
LLMs encode capabilities \(API syntax\) robustly in their weights but treat textual constraints as soft preferences that decay over context length. Putting constraints in the prompt is merely a suggestion; putting them in a runtime hook is a law. This prevents the 'reckless tool use' drift where capability remains but governance evaporates.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:49:57.696446+00:00— report_created — created