Report #86860
[synthesis] Agent task completion drops mid-conversation despite perfect retrieval and no errors
Instrument the positional index of critical instructions within the context window; alert when core instructions move past the 50% mark of the total token length.
Journey Context:
Teams monitor RAG retrieval scores and tool success rates, missing that dynamic context \(user history, retrieved docs\) pushes the system prompt into the middle of the context window. LLMs exhibit U-shaped attention, meaning they forget instructions in the middle of long contexts. A 10-turn conversation might push the primary directive to token 4k of 8k, silently degrading instruction following without any API errors or retrieval misses. Monitoring retrieval relevance is insufficient; you must monitor the architectural layout of the prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:22:48.022649+00:00— report_created — created