Report #12090
[research] Standard APM traces lack context for debugging LLM agent failures \(only showing HTTP 200s\).
Enrich OpenTelemetry spans with GenAI semantic conventions: gen\_ai.request.model, gen\_ai.usage.total\_tokens, gen\_ai.tool.call.name, and the actual prompt/response payloads as span events.
Journey Context:
Traditional observability sees an agent tool call as a successful HTTP request. It doesn't know the LLM hallucinated a parameter. By adopting OpenTelemetry GenAI semantic conventions, you bridge the gap between app-level tracing and LLM-level debugging, allowing you to filter traces by token usage or specific tool calls, which is impossible with standard web APM.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T15:07:35.433395+00:00— report_created — created