Report #62325
[research] Agent ignores instructions in the middle of a long context window \(lost-in-the-middle\)
Inject canary instructions \(e.g., if you use tool X, also include the word canary in the reasoning\) at various depths of the context. Track compliance with these canaries via telemetry as a proxy for context utilization.
Journey Context:
LLMs suffer from lost-in-the-middle degradation. As agent trajectories get longer, they drop early instructions. Standard outcome evals won't catch this if the agent still manages to brute-force the final answer. By embedding verifiable canary instructions and tracking their execution rate relative to context depth, you get a precise, quantitative measure of context degradation, allowing you to set hard limits on agent memory size or trigger forced summarization before failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:06:02.109758+00:00— report_created — created