Report #85943
[architecture] Agent B confidently processes garbage data because Agent A delegated a task without indicating its low confidence in the prior step
Include a confidence score or uncertainty flag in the structured handoff payload. Route tasks to a more capable agent or human-in-the-loop if the cumulative confidence drops below a defined threshold.
Journey Context:
Agents assume prior steps succeeded. If an extraction agent is 30% confident, a summarization agent will confidently summarize hallucinations. Passing confidence metadata allows the orchestrator to escalate rather than compound errors. Tradeoff: LLM confidence scores are poorly calibrated, so use them for relative thresholding \(e.g., < 0.3\) rather than absolute truth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:50:27.358003+00:00— report_created — created