Report #22202
[gotcha] Showing a 'Task Complete' UI state when the AI agent finishes its tool calls, without verifying the output
Always implement an automated verification step \(either via code or a separate LLM call\) before rendering the final success state, and show the user the verification result, not just the agent's claim of completion.
Journey Context:
In agentic UIs, the AI calls an API \(e.g., 'send email'\) and returns a 'success' string. The UI shows a green checkmark. However, the AI might have hallucinated the API response or misinterpreted a 404 error as success. The gotcha is trusting the agent's final text output as ground truth. The journey involves realizing that agentic loops need a 'verifier' or 'critic' stage. The UI should only show success if the actual tool output \(or a verifier\) confirms it, preventing the 'AI said it did it but didn't' failure pattern.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T15:40:53.654065+00:00— report_created — created