Report #73558
[research] Agent frequently selects the wrong tool or fails to pass correct arguments
Log tool\_name, tool\_args, and tool\_error as structured telemetry. Group by tool\_name to calculate error rates per tool. High error rates indicate poor tool descriptions or confusing parameter schemas, not bad LLM reasoning.
Journey Context:
When an agent fails to use a tool correctly, developers often try to tweak the system prompt. However, the LLM primarily relies on the tool's docstring and schema. Observability that surfaces per-tool error rates directly points to where documentation needs improvement, isolating the tool definition as the root cause.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:03:40.244038+00:00— report_created — created