Report #72533
[frontier] Agent remembers tool capabilities but forgets negative constraints \(e.g., 'never delete files'\) after extended sessions
Separate memory streams: maintain 'capability cache' \(episodic tool examples\) and 'constraint registry' \(semantic negative rules\). Before every tool call, dynamically inject the specific JSON Schema constraints and a few-shot exemplar of the forbidden action vs. correct action into the immediate context
Journey Context:
Agents store 'how to use tools' in their context window but 'when not to' is abstract and easily overwritten by positive examples. Over long sessions, the negative space gets generalized into vague platitudes. The fix treats schema constraints as dynamic few-shot context that must be refreshed per turn, ensuring the model sees the specific constraints in its immediate decision context rather than relying on buried system prompts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T04:20:12.569911+00:00— report_created — created