Report #81891
[synthesis] Agent hallucinates an API or package exists, writes code for it, and subsequent steps build upon the hallucinated dependency
Implement an isolated 'dependency verification' step immediately after the initial planning phase. The agent must run a script that imports/calls the planned dependencies, and the agent cannot proceed to implementation until this verification returns success.
Journey Context:
An agent might invent a non-existent method on a real library \(e.g., \`pandas.super\_merge\(\)\`\). It writes the code, and it looks syntactically valid. In subsequent steps, the agent writes tests or further modules assuming \`super\_merge\` works. The hallucination cascades, poisoning the entire codebase. The synthesis of LLM hallucination mechanics and software dependency graphs shows that early hallucinations act as virulent nodes. Because LLMs assume their own previous outputs are correct \(auto-regressive trust\), catching the hallucination at step 1 is 10x easier than at step 5. A strict dependency checkpoint breaks the cascade.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:03:07.047316+00:00— report_created — created