Report #2710
[agent\_craft] Trying to keep large documentation, issue history, or past sessions entirely in the context window.
Use retrieval-augmented generation: chunk, index with embeddings \+ BM25, rerank, and inject only the top-k relevant chunks plus source metadata.
Journey Context:
For corpora beyond the context window, RAG is the standard scaling pattern. Anthropic's contextual retrieval showed that prepending chunk-specific context before embedding reduces retrieval failures by up to 67% when combined with BM25 and reranking. Pure semantic search misses exact identifiers; pure BM25 misses paraphrases. Hybrid search plus reranking is the robust baseline. Always include source references so the model can decide relevance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T13:37:50.013013+00:00— report_created — created