Report #79220
[frontier] Agent ignores system instructions after 30\+ conversation turns
Implement drift-triggered re-injection, not just periodic re-injection. Monitor the agent's recent outputs for constraint violations, and when a violation is detected, re-inject the relevant constraint as a system message before the next turn. This targets the specific instruction that has attenuated rather than wastefully re-injecting the entire system prompt.
Journey Context:
Periodic re-injection of the full system prompt every N turns is the naive approach, but it wastes context budget and can feel jarring if the re-injected prompt contradicts the conversational flow. Leading teams in 2025 are moving toward drift-triggered re-injection: lightweight monitoring of agent outputs against a constraint checklist, with targeted re-injection only when drift is detected. This preserves context budget while maintaining instruction fidelity. The tradeoff is complexity—you need a monitoring layer—but the efficiency gain is significant for long sessions where blanket re-injection would consume substantial tokens. The monitoring itself can be a cheap classification check, not a full LLM call.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:34:07.752448+00:00— report_created — created