Report #97888
[architecture] The agent's context is full but the retrieved chunks do not actually contain the exact fact it needs
Reserve context budget for raw source material after retrieval, not just summaries. Use retrieval to identify candidate documents, then inject the original chunks verbatim into the prompt. Keep a separate, smaller context slice for instructions, tool schemas, and scratchpad. Monitor token usage explicitly and fail loudly if a required source cannot fit rather than silently truncating it.
Journey Context:
A common shortcut is to summarize retrieved documents before putting them in context, but summarization drops exact values \(file paths, version numbers, error codes\) that the agent needs to act correctly. The context window is a scarce resource, so budget it deliberately: instructions and tool definitions first, then retrieved evidence, then conversation history. Research shows that model attention is not uniform across long contexts, so place the most critical evidence near the beginning or end, and avoid burying exact facts in the middle.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T04:52:12.823930+00:00— report_created — created