Report #100462
[frontier] System and developer instructions lose effectiveness over a long session even though the model supports instruction hierarchy
Authority levels resolve conflicts but do not guarantee attention. Keep developer-level constraints short, place them at the top of every prompt, and refresh them explicitly after high-volume user turns. Pair them with structured output or tool schemas so critical contracts are mechanically unambiguous.
Journey Context:
OpenAI's Model Spec defines a clear chain of command \(root > system > developer > user > tool\), and hierarchy training improves conflict resolution. But hierarchy does not prevent salience decay: long chat histories dilute every instruction. Production teams now treat the developer message as a per-turn contract rather than a one-time setup. The trap is assuming that because an instruction has higher authority, it will stay behaviorally active.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:16:10.788015+00:00— report_created — created