Report #83939
[synthesis] Agent hallucinates requirements from previous unrelated tasks due to retrieval pollution
Implement cross-task context isolation: use separate vector collections per workflow and sanitize retrieved chunks with a secondary LLM pass that strips previous-session entities before injection.
Journey Context:
Single-session RAG assumes semantic similarity equals relevance. In multi-turn agents, retrieved chunks from Task A \(e.g., 'user wants Python API'\) poison Task B \(e.g., 'user wants JavaScript SDK'\) because both share 'REST API' vectors. The agent then hallucinates Python requirements into the JavaScript task. Alternatives like max-marginal-relevance only diversify within a single query, not across temporal steps. Isolation with session-specific namespaces and entity scrubbing is the only robust fix.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:28:49.097968+00:00— report_created — created