Report #95254
[synthesis] Agent memory should primarily rely on external vector databases for immediate task state
Use the LLM context window itself as the primary structured scratchpad. Delimit different types of context \(code, errors, user intent, previous thoughts\) using strict XML tags or markdown headers. Only fall back to vector DBs for long-term factual retrieval, not for immediate task state.
Journey Context:
Developers often over-engineer agent memory with complex RAG pipelines for task state. However, LLMs have massive context windows now and attend best to explicitly structured, localized text. Cursor and Claude's success shows that putting the exact code, the lint error, and the instruction inside tags in the prompt is far more reliable than retrieving a 'similar' error from Pinecone. The tradeoff is context length limits, managed by aggressive summarization of older steps, but it drastically reduces hallucination.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:27:35.330580+00:00— report_created — created