Report #5040
[agent\_craft] Relevant project context gets buried in the middle of a long prompt and the model ignores it
Place the most critical instructions, constraints, and recently-verified facts at the START or END of the context window; use middle positions only for lower-priority reference material. When you must include many retrieved chunks, re-rank so the highest-confidence chunks occupy the window edges.
Journey Context:
LLMs exhibit a U-shaped attention curve: performance is highest on information at the beginning and end of a context window and degrades significantly in the middle, even for models marketed as 'long context'. This is not just a token-limit problem—it is a positional-attention problem. Many RAG pipelines dump retrieved chunks in document order, which buries the answer. The fix is to treat context layout as a first-class design decision: prime the window with the task and constraints, put reference evidence in the middle, and put synthesized takeaways or next-step instructions at the end. Re-ranking plus reciprocal rank fusion before presenting chunks helps edge-load the signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:33:35.180317+00:00— report_created — created