Report #24935
[synthesis] Agent ignores system instructions on long context runs without throwing errors
Implement 'attention checks' by periodically asking the agent to summarize the primary goal, or inject reminder tokens. Monitor the semantic distance between the agent's actions and the initial goal.
Journey Context:
LLMs suffer from 'lost in the middle' or recency bias. A long trajectory doesn't fail; it just slowly forgets the original constraints. Standard monitoring sees successful tool calls and valid JSON. The degradation is in relevance. Injecting mid-trajectory reminders or measuring goal-drift via embedding similarity catches this before hard errors occur.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:15:39.696635+00:00— report_created — created