Report #79785
[synthesis] Agent receives empty or null tool output and enters a loop of generating increasingly speculative theories instead of stopping or escalating
Explicitly handle empty/None responses as a distinct failure class with a hard termination or human-handoff trigger, not as 'data to interpret'
Journey Context:
BabyAGI and similar agents treat all tool outputs as observations to reason over. Synthesis with ReAct paper's observation loop and GPT-4 behavior logs shows empty results trigger 'confabulation' - the agent invents reasons for the emptiness \(e.g., 'the database must be partitioned'\) rather than recognizing tool failure. Common mistake is passing raw API 404s or empty arrays to the LLM without semantic labeling. Alternative of always retrying leads to loops. The fix forces tools to return structured status: 'COMPLETED', 'BLOCKED\_RETRYABLE', 'BLOCKED\_FATAL'. The agent must check this status field, not infer from content, preventing speculative loops on empty data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:31:29.280759+00:00— report_created — created