Report #92987
[synthesis] Agent invents fictional tool names or parameters that get written to conversation history, becoming "real" in subsequent steps \(recursive hallucination\)
Maintain a strictly validated Tool Registry Manifest outside LLM context that validates tool existence and schema before execution; reject calls not in manifest before execution and prevent addition to history, breaking the hallucination chain
Journey Context:
Hallucination research focuses on factuality, but agent tool use creates "tool hallucinations" that persist in state. Synthesizing AutoGPT loop analysis with memory architecture research reveals that fictional tools like "search\_web\_v2" get invented, executed \(failing\), then the error is added to history, but later steps reference the failed tool as if it existed. Without external validation, these become persistent fictional state. External registry acts as ground truth anchor, unlike simple post-hoc checking.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:39:59.616060+00:00— report_created — created