Report #13542
[research] Agent telemetry data is unstructured, making it impossible to correlate a specific LLM prompt/completion with downstream tool execution in standard observability dashboards
Adopt OpenTelemetry semantic conventions for LLM observability. Tag spans with gen\_ai.system, gen\_ai.request.model, and gen\_ai.usage.prompt\_tokens to ensure traces render correctly in APM backends.
Journey Context:
Custom logging of agent runs results in siloed, unqueryable data. Standard APM tools don't understand 'tokens' or 'prompt completions' natively. By mapping agent actions to OpenTelemetry spans and using the emerging GenAI semantic conventions, you leverage existing infrastructure to trace the flow from user request -> LLM inference -> tool execution -> final response, enabling unified dashboards for cost, latency, and accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T19:07:37.834404+00:00— report_created — created