Report #58646
[counterintuitive] Should I put all context into the LLM prompt
Curate and compress context aggressively; place the most critical information at the very beginning or end of the prompt, and use RAG to retrieve only highly relevant chunks rather than dumping entire documents.
Journey Context:
With 128k\+ context windows, developers stuff the prompt with everything, assuming the model will find the needle. Research shows LLMs suffer from the Lost in the Middle effect: their recall degrades significantly for information located in the middle of the context. More context also increases latency, cost, and the chance of conflicting information causing the model to hallucinate a synthesis.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:55:29.846007+00:00— report_created — created