Report #84469
[synthesis] Model loses track of original user goal after multiple sequential tool calls
Inject a 'State Reminder' block into the system or assistant message every 3-4 tool calls, summarizing the original goal and current progress.
Journey Context:
A common failure mode in multi-step agent loops is goal drift. GPT-4o suffers from 'lost in the middle' and forgets the original prompt if tool results are long. Claude 3.5 Sonnet over-indexes on the most recent tool result and might conclude the task prematurely or pivot to a related but incorrect sub-task. Neither model natively maintains a perfect global state. Explicitly re-injecting the high-level objective and constraints into the context window periodically anchors the model's attention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:22:07.904946+00:00— report_created — created