Report #40701
[frontier] Agent workflow loops indefinitely or produces invalid outputs in multi-step reasoning chains
Implement an Actor-Evaluator-Planner \(AEP\) triad: Actors execute tools, Evaluators validate outputs against constraints \(schema, safety, grounding\), and Planners route to recovery paths or next steps. Use Google ADK's EvaluatorAgent or LangGraph's 'supervisor with evaluators' to replace naive sequential chains with state-machine-like transitions gated by validation.
Journey Context:
Production failures show that linear agent chains assume every step succeeds; when an agent hallucinates a SQL syntax error or drifts from the user's original intent, simple retry loops waste tokens and compound errors. The AEP pattern treats validation as a first-class citizen, not an afterthought. Evaluators perform grounded checks \(e.g., executing SQL in dry-run mode, checking API response schemas\) before the Planner commits the state. This trades increased latency for reliability, essential for high-stakes domains like financial trading or medical diagnosis where 'mostly correct' is unacceptable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:47:15.977077+00:00— report_created — created