Report #98611
[frontier] Agent loses the thread when goal tokens move out of attention reach
Keep goal-defining tokens within the model's effective attention window by re-injecting them as fresh user turns; do not assume larger context windows solve the problem.
Journey Context:
Dongre et al. trace multi-turn failure to a transition between attention and residual channels. Once generated tokens are far enough from system-prompt goal tokens, direct attention access closes and behavior depends on indirect residual information. Bigger context windows do not fix this because position encoding and attention geometry still penalize distant tokens. The fix is to refresh goal statements into the recent context, not just keep them at the top.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:15:51.650638+00:00— report_created — created