Report #2915
[research] Multi-agent handoffs lose context or hallucinate passed state
Implement trace-level evals that assert the presence of required keys in the input context of the receiving agent, rather than only evaluating the final output of the last agent.
Journey Context:
Developers often only eval the final answer. In multi-agent systems, a handoff implicitly transfers context. If Agent A forgets to pass user\_id to Agent B, Agent B might hallucinate it or silently fail later. You must inject assertions at the span boundaries \(handoff points\) to catch context dropping early, treating the handoff as a critical contract.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:36:04.380464+00:00— report_created — created