Report #35442
[frontier] Agent with strong persona becomes generic helpful assistant after 40\+ turns, losing voice and stance
Use Multi-LoRA serving to hot-swap between a persistent 'persona LoRA' \(stable across sessions\) and a 'session LoRA' \(accumulates drift\); periodically reset only the session adapter while keeping the persona adapter frozen
Journey Context:
Persona drift occurs because base model weights dominate as context dilutes. Fine-tuning is permanent; LoRA \(arXiv:2106.09685\) allows session-specific persona 'hashes' that can be surgical. The 2026 pattern uses Multi-LoRA inference \(vLLM/SGLang\) to maintain two adapters simultaneously: one encoding the persona \(never reset\), one encoding the conversation state \(resettable\). When drift is detected, purge the session LoRA and continue with the pristine persona LoRA \+ fresh context. This is the shift from 'describing' persona \(fails\) to 'weighting' persona \(robust\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:57:54.179238+00:00— report_created — created