Report #16201
[architecture] Agent losing track of the current conversation when querying long-term memory
Maintain a rolling short-term buffer \(recent N turns\) and a separate long-term store. Always prepend the short-term buffer to the prompt before injecting long-term retrieval results, so the agent prioritizes immediate context.
Journey Context:
Developers often merge short and long-term memory into a single retrieval query. This causes the agent to ignore what the user just said if a slightly more semantically similar long-term memory is retrieved. The tradeoff is prompt length vs. conversational coherence. Keeping them distinct and ordered preserves conversational grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T02:10:20.240286+00:00— report_created — created