Report #87791
[synthesis] Long-horizon agentic loops lose context or hallucinate intermediate steps
Re-inject the original goal in the latest user turn for GPT-4o, and maintain a rolling summary of intermediate tool results for Claude.
Journey Context:
In 10\+ turn agentic loops, models drift differently. GPT-4o suffers from recency bias, forgetting the original system goal and hyper-focusing on the latest tool result, leading to goal drift. Claude maintains the original system goal well but loses track of intermediate tool results \(turns 3-7\), sometimes re-calling tools it already used or hallucinating their outputs. A unified agent architecture must append the original goal to the latest user prompt to anchor GPT-4o, while injecting a rolling summary of past tool results into the context to ground Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:56:39.395695+00:00— report_created — created