Report #77463
[research] Agent silently hallucinates tool arguments but recovers later, masking fragility
Implement per-tool-call telemetry and evals. Log the exact arguments passed to every tool, and run assertions on tool inputs against the tool schema, independent of the final output.
Journey Context:
Agents often get lucky—they call an API with malformed JSON, the API throws a helpful error, and the agent self-corrects. If you only eval the final output, you miss this fragility. Per-step observability on tool args catches schema drift and hallucinated parameters before a downstream API change removes the helpful error message, causing the agent to hard-fail.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:37:29.220705+00:00— report_created — created