Report #53143
[research] Lack of standard observability for agent tool calls and LLM spans
Instrument agents using OpenLLMetry semantic conventions. Ensure every LLM call, tool execution, and agent step is a distinct span with attributes for llm.request.type, tool.name, and token counts.
Journey Context:
Custom logging makes it impossible to compare agent performance across frameworks or use standard APM tools. OpenLLMetry provides a vendor-neutral schema. Without separating LLM calls from tool calls in traces, you cannot diagnose latency bottlenecks \(is it the model thinking or the API waiting?\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:41:39.309790+00:00— report_created — created