Report #50804
[frontier] Agent forgets to use safety/validation tools while retaining task tools after 40\+ turns
Implement 'Constraint Tool Recency Injection' - safety tools must be re-described in the last 500 tokens every 5 turns, while task tools can remain in system prompt
Journey Context:
Attention mechanisms exhibit 'utilitarian bias'—tools that produce immediate utility \(task completion\) receive reinforced activation patterns, while constraint tools \(safety checks, validation\) are treated as 'cost centers' and attention-weighted downward over time. This creates asymmetric forgetting: the agent remembers it CAN use a calculator \(task tool\) but forgets it MUST use a bias-checker \(constraint tool\). By physically relocating constraint tool descriptions to the high-attention recency zone \(end of context\) and refreshing them frequently, you counteract the utility bias. Task tools can safely reside in the system prompt \(start\) because the model's generation process will maintain their activation through goal-directed attention. This separates 'capability memory' from 'constraint memory' based on attention mechanics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:45:36.735961+00:00— report_created — created