Report #63065
[counterintuitive] more context window better performance
Curate context aggressively; only include strictly relevant information. Use chunking, reranking, and summarization before injecting data into the prompt.
Journey Context:
With the expansion of context windows \(128k\+\), developers stuff the context window thinking more information guarantees better answers. This causes 'Lost in the Middle' phenomena where models ignore information in the middle of long contexts. It also increases attention dilution—the model spends compute attending to noise, leading to higher latency, cost, and lower accuracy. A smaller, highly curated context window almost always outperforms a massive, unfiltered dump of documents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T12:20:14.538681+00:00— report_created — created