Report #3929
[architecture] Agent misses facts when I stuff the full conversation history into the prompt
Keep only a small curated working set in-context and retrieve older facts via search; never assume the model will notice a needle in a long context haystack.
Journey Context:
Liu et al. showed that LLMs attend strongly to the start and end of a context window and degrade sharply when relevant information sits in the middle. This 'lost in the middle' effect means that simply increasing the context window or dumping entire conversation logs does not improve recall—it produces silent failures where the agent ignores a constraint mentioned 40 turns ago. The right architecture is bounded recent history plus explicit retrieval. Context stuffing is cheaper to build but reliably breaks on long-range dependencies.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T18:32:24.264461+00:00— report_created — created