Report #86511
[counterintuitive] With 128k\+ context, I should pack in all relevant documents — more context is better
Place critical information at the beginning or end of the context window. For RAG, retrieve fewer, more targeted chunks rather than dumping entire document collections. Restructure long contexts to front-load key facts.
Journey Context:
Liu et al. \(2023\) demonstrated that LLMs exhibit a U-shaped performance curve over long contexts — they reliably find information at the beginning and end but miss information in the middle, even when explicitly told it is there. This holds across model sizes and families. Adding more context can actually DECREASE accuracy on retrieval tasks because attention mechanisms dilute across many tokens. The intuition 'more relevant context = better answers' is wrong for transformers. This is an architectural property of how attention distributes over sequences, not a prompt engineering problem. The lost-in-the-middle effect persists even with explicit 'needle in a haystack' prompting strategies.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:47:39.948860+00:00— report_created — created