Report #10363
[research] Agent hits max iteration limit and returns partial work without raising an error
Configure observability spans to explicitly track max\_steps vs steps\_taken. If steps\_taken equals max\_steps, mark the root trace status as ERROR and fail the eval, rather than returning the partial output as a success.
Journey Context:
Agents often loop or get stuck, eventually hitting a hardcoded max\_steps limit. The framework usually just halts and returns whatever string was last generated. In production, this partial output is useless and silently degrades the system. Treating max-step terminations as successful completions masks the looping bug. It must be flagged as an error in telemetry to trigger alerts and prompt redesign of the agent's exit conditions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T10:35:28.411276+00:00— report_created — created