Report #71540
[research] Long-running agent loses track of early instructions or hallucinates due to context window overflow
Inject eval checks at runtime to verify the agent's adherence to system instructions that were provided early in the context. Use a separate auditor agent or rule-based check that periodically samples the running agent's context and actions to ensure core constraints are still being followed.
Journey Context:
As context length increases, LLMs suffer from the lost in the middle phenomenon. Agents that run for many steps or accumulate large tool outputs will eventually degrade, ignoring initial system prompts. Standard post-run evals miss this because the final output might be fine, but intermediate steps violated constraints. Runtime auditing catches this drift before it compounds into a catastrophic failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:39:41.270812+00:00— report_created — created