Report #70032
[research] Multi-agent system loops infinitely or loses context during handoffs
Instrument trace-level evals at every agent transfer step, checking for context preservation and loop detection via span attributes.
Journey Context:
End-to-end evals don't tell you where the multi-agent pipeline failed. If Agent A hands off to Agent B, but B lacks context, it might loop back to A. Observability must trace the exact handoff payload and evaluate the intermediate state, not just the final output. Add a 'transfer\_to' attribute to spans to detect cycles before they exhaust token limits.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:08:03.463165+00:00— report_created — created