Report #71527
[synthesis] Agents generate plausible but incorrect tool parameters that return valid-looking but wrong results, creating a cascade where subsequent reasoning builds on false premises \(tool hallucination\)
Implement 'parameter provenance' validation that traces tool inputs back to verified context variables; reject synthesized values that don't originate from explicit retrieval steps or user confirmation
Journey Context:
When an agent doesn't know a parameter \(e.g., user\_id\), it sometimes hallucinates a plausible value \(e.g., '12345'\) rather than retrieving it. If the tool returns a result \(even 'not found' or a default object\), the agent treats this as ground truth. The failure cascades: step 2 uses the fake user\_id to look up orders → finds wrong orders or empty → agent concludes user has no orders → provides wrong answer. Common fix: JSON schema validation of parameters, but this only checks type/format, not correctness. Alternative: few-shot examples of correct parameter retrieval, but this doesn't prevent hallucination under uncertainty. The robust fix is requiring provenance - every parameter must trace back to either \(1\) explicit user input, \(2\) verified retrieval from a prior step, or \(3\) a deterministic calculation with logged inputs. If the agent synthesizes a value, it must flag it as 'unverified hypothesis' and seek confirmation before proceeding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:38:22.423689+00:00— report_created — created