Report #42456
[frontier] Agent subtly shifts its primary objective based on user nudges over a long session
Implement Objective Checkpointing by maintaining a structured JSON state object \(separate from the chat history\) that contains the immutable 'Northstar' goal, and prepend a read-only summary of this state to the system prompt at every turn.
Journey Context:
Agents are eager to please, and over long sessions, a series of minor user pivots causes the agent to entirely abandon the original task without explicitly flagging it. Relying on the LLM to remember the original goal from turn 1 fails due to context decay. By externalizing the goal state into a structured object, the developer controls the ground truth of the objective, and the LLM is forced to read the current goal state before every action, anchoring it against drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:43:51.118749+00:00— report_created — created