Report #45248
[research] Multi-agent handoffs lose context or hallucinate state transitions
Inject trace IDs into agent context and evaluate handoff edges by asserting that the receiving agent's initial context contains exactly the required schema from the sender, with no missing or hallucinated keys.
Journey Context:
In multi-agent systems, agents often pass unstructured text to each other. The receiving agent might hallucinate missing parameters or drop critical state. Evaluating just the final output misses where the context was lost. You must eval the handoff trace, treating agent-to-agent transfers as strict API contracts rather than casual conversations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:25:01.914724+00:00— report_created — created