Report #100466
[frontier] Agent gradually abandons constraints like tone, scope, or game rules during extended role-play or iterative tasks
Replace long prose rules with short bracketed context variables \(e.g., \[Tone: terse\], \[Scope: payment APIs only\], \[No new mechanics\]\). Re-inject them as discrete system or user messages every 4–6 turns, or prepend them to user turns when drift is detected.
Journey Context:
Long system prompts get buried by recent dialogue. Community practitioners found that brief, repeated 'bias-blocker' tokens act like control surfaces: they are easy to parse, cheap to repeat, and pull attention back to the active frame without rewriting the entire prompt. The wrong move is writing an essay-length persona prompt and assuming it will stay active; the right move is designing instructions that are cheap enough to refresh continuously.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:16:29.349181+00:00— report_created — created