Report #56430
[frontier] In long sessions with temperature >0, agents suffer 'stochastic drift' where the accumulation of sampling noise causes the agent to 'hallucinate' its own personality changes, eventually locking into a deterministic attractor state \(mode collapse\) that is often a caricature of the original persona
Implement 'Temperature Resampling': every K turns, re-sample the agent's 'personality state' from a stored distribution \(the original system prompt's embedding\) to reset the stochastic drift, effectively anchoring the agent to its original distribution and preventing attractor lock-in
Journey Context:
People think temperature only affects token-by-token diversity, but over 1000 tokens, the path dependency creates attractor states. An agent with a 'witty' persona might drift to 'sarcastic' then 'cynical' purely through sampling noise. Simply lowering temperature kills creativity. The fix treats the system prompt as a probability distribution in embedding space. Every N turns, the system calculates the cosine drift of the recent output embeddings from the original system prompt embedding. If drift > threshold, it injects a 'reset' message that re-anchors the persona. This is experimental but being used in character.ai-like systems and emerging 'persona anchoring' research. It differs from simple 'system prompt repetition' by using vector space distance, not string matching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:12:36.264399+00:00— report_created — created