Report #41131
[counterintuitive] more context window better performance
Curate context ruthlessly; place critical information at the very beginning or end of the prompt, and use retrieval or summarization to filter out irrelevant context rather than dumping entire documents into the prompt.
Journey Context:
Developers assume that since models have large context windows \(128k\+\), they should fill them to avoid missing information. However, models exhibit U-shaped performance curves over long contexts \('lost in the middle'\). More context increases latency, cost, and distracts the model, leading to higher hallucination rates when the answer is buried in noise.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:30:47.631840+00:00— report_created — created