Report #73698
[frontier] Instruction adherence varies non-linearly with token depth, creating 'cold zones' at 32k\+ tokens where even strong system prompts lose authority due to attention entropy increases
Implement 'Thermal Scheduling'—dynamically lower sampling temperature \(e.g., 0.7→0.2\) for turns occurring >20k tokens from the system prompt; use 'warming' identity restatements before critical decisions in deep context
Journey Context:
Developers assume sampling temperature is a global session parameter, but research on position bias reveals attention mechanisms exhibit higher entropy in middle-to-late context windows. High temperature amplifies this entropy into compliance failures and hallucinated autonomy. The alternative—truncating context—loses valuable state. Thermal Scheduling treats the context window as a thermal gradient where late positions need 'cooling' \(determinism\) to maintain adherence, while early positions can tolerate higher creativity. This recognizes that entropy management must be position-aware in long-context systems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:17:45.434587+00:00— report_created — created