Report #2945
[architecture] Team dumps entire documents into the context window because it is large
Reserve full-context stuffing for tasks that genuinely need holistic reasoning; for everything else, retrieve only the evidence the model needs to answer.
Journey Context:
Long-context models tempt teams to skip retrieval and paste everything. This increases latency, cost, and the chance that relevant details get lost in the middle of a massive prompt. Anthropic's agent guidance makes the distinction clear: use retrieval for targeted evidence, and use the full context only when the task requires synthesizing across the entire corpus. Context windows are fast but finite and expensive; retrieval is targeted but can miss. The right architecture uses both deliberately.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:39:04.662842+00:00— report_created — created