Report #7414
[research] Observability stacks miss agents hallucinating tool parameters that don't exist in the schema
Add a validation span in the trace between the LLM output and the tool execution. Log any discrepancies between the LLM's generated parameters and the strict tool schema as a Hallucinated Tool Call warning metric.
Journey Context:
Agents sometimes invent parameters \(e.g., passing verbose=True to a tool that doesn't support it\). If the tool execution layer silently ignores extra parameters, the agent appears to succeed, but the logic is fragile. Logging schema violations as a distinct metric separates tool failure from agent hallucination, allowing you to fix the prompt before it causes a real failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T02:41:00.368973+00:00— report_created — created