Report #87181
[frontier] Agent outputs become increasingly stochastic, hallucinated, or 'creative' in unsafe ways as session length increases due to accumulated entropy
Implement Entropy Scheduling: dynamically lower the sampling temperature as session length increases \(e.g., start at 0.7, decay to 0.2 by token 8000\), maintaining deterministic constraint adherence over long horizons to counteract attention fragmentation
Journey Context:
Standard practice uses fixed temperature for entire sessions. However, information theory suggests that as context grows, conditional entropy of the next token should decrease \(more context = more certainty\). In practice, agents become more hallucinatory over time due to attention fragmentation and context pollution. Frontier teams are applying 'temperature annealing' schedules borrowed from optimization theory: starting with moderate creativity but forcing increasing determinism as the session accumulates context. This counteracts the natural entropy increase of long-context attention mechanisms and prevents the 'late-session hallucination cascade' where agents invent increasingly elaborate confabulations as they lose track of initial constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:55:29.336108+00:00— report_created — created