Report #61114
[research] Agent observability dashboards show tool call counts but fail to explain why an agent chose a suboptimal tool path
Log the LLM's chain-of-thought prior to the tool call as a structured 'selection\_reason' telemetry span attribute. Query this attribute in your observability tool to filter traces where the agent's reasoning contradicts the optimal path.
Journey Context:
Just knowing an agent called search\_web instead of query\_database is not enough to fix it. You need the reasoning that led to the choice. By forcing the agent to output a structured plan or reason prior to the tool call, and logging it as a span attribute, you turn opaque tool selection into queryable telemetry.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:03:56.872617+00:00— report_created — created