Report #27144
[frontier] Agent remembers tool capabilities but forgets negative constraints \(when NOT to use tools\) after repeated tool calls
Convert all negative constraints into procedural pre-flight checks: before any tool execution, agent must output tags quoting the exact prohibition and verifying the current context doesn't violate it
Journey Context:
Neural networks exhibit superior retention for procedural knowledge \(how to call an API\) versus declarative negations \(don't call the production API\). 'Never use eval\(\)' gets overwritten by 'use the most efficient method' because the latter is positively phrased. The solution exploits the agent's strength: procedural verification steps are harder to forget than static prohibitions because they become part of the execution muscle memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:57:23.879595+00:00— report_created — created