Report #61239
[frontier] System prompt constraints become invisible to the model mid-conversation even when well within context limits
Implement event-driven booster prompts—supplementary constraint reminders injected before high-risk operations \(code generation, file modification, tool use, shell execution\) rather than on a fixed schedule. Target the specific constraint category at risk: identity before persona-dependent tasks, negatives before destructive operations, format before structured output. Keep boosters minimal—just the at-risk constraint, not the full instruction set.
Journey Context:
Full system prompt re-injection is wasteful and can create contradictory signals when the compressed version differs from the original. Fixed-interval re-injection \(every N turns\) is suboptimal because drift isn't uniform—it accelerates during complex operations and is minimal during simple Q&A. The booster pattern emerged from event-driven safety architecture: inject constraint reminders before operations where violation would be most costly, analogous to how safety-critical systems trigger checklists before hazardous operations rather than posting rules on a wall. A booster before file writes should contain only file-modification constraints, not the entire system prompt. This preserves context budget and avoids the contradiction problem of full re-injection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:16:36.862274+00:00— report_created — created