Report #65810
[architecture] Agents disagree on when a task is done, leading to premature handoffs or endless refinement
Define explicit, machine-verifiable exit criteria \(e.g., unit tests passing, JSON schema validation, specific output keys present\) in the initial task prompt, rather than relying on subjective LLM judgment.
Journey Context:
'Complete the task' is highly ambiguous to an LLM. Agent A might think a 1-paragraph summary is done; Agent B expects a 3-page report. Without deterministic exit criteria, agents either hand off incomplete work \(causing downstream failures\) or loop endlessly trying to perfect it. Programmatic validation bridges the gap between subjective generation and objective completion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:56:29.763268+00:00— report_created — created