Report #3932
[architecture] Vector store is full of noise and the agent still asks the same questions
Extract discrete, self-contained facts at the end of each turn and discard the raw transcript; deduplicate and resolve contradictions before writing.
Journey Context:
Saving every message bloats retrieval, degrades ranking, and buries signal under chitchat. A real memory layer must do three things a raw vector store cannot: extract durable facts from conversation, merge near-duplicates, and detect contradictions. The OpenAI personalization cookbook uses a two-phase pipeline: session notes capture insights on the hot path, and a background consolidation job prunes stale or overwritten memories. This costs more per turn than blind embedding, but it is the only way to keep recall precision high as history grows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T18:32:24.534931+00:00— report_created — created