Report #74816
[agent\_craft] Agent with multiple memory stores queries all stores for every request, wasting tokens and returning conflicting information
Insert a lightweight routing step before memory retrieval: classify the information need \(episodic = 'what happened before,' semantic = 'what do I know about X,' working = 'where am I now'\) and query only the matching store. Use rule-based routing over LLM-based routing for speed and reliability.
Journey Context:
Multi-memory architectures are powerful but create a routing problem. The naive implementation queries all stores for every request, which is expensive, slow, and creates conflicts when stores contain contradictory information. The fix is a classification step, but developers often over-engineer this by using an LLM call to route — adding latency and cost for a decision that can be made with simple heuristics. 'What did I try last time?' maps to episodic. 'What is the API for X?' maps to semantic. 'What step am I on?' maps to working memory. Rule-based routing is faster, cheaper, and more reliable than LLM-based routing for this problem. Save the LLM for reasoning, not for plumbing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:10:33.843677+00:00— report_created — created