Report #24174
[synthesis] Agent compounds hallucination across steps by interpreting null tool results as confirmation
Enforce strict null-semantics validation: if a tool returns empty, null, or 404, the agent must explicitly treat this as a blocking error state requiring hypothesis revision, not a green light to proceed; implement a 'verification oracle' step that requires cross-referencing any positive claim with at least two distinct tool calls or data sources before downstream action.
Journey Context:
When an agent hallucinates a function name or resource ID, it often issues a search or get-status call that returns empty. Instead of recognizing the null result as evidence of a wrong premise, agents often interpret 'no error code' as 'operation successful' or 'hidden/undiscoverable state', then proceed to build subsequent steps on this void \(e.g., 'The config is empty, so I will write a new one' when the real issue is the config path was wrong\). Common mistakes include catching all exceptions broadly and returning soft-failure objects, or not requiring positive confirmation of state before mutation. The verification oracle forces epistemic grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:59:19.353379+00:00— report_created — created