Report #73517
[frontier] Cannot trace agent decision paths or debug multi-step LLM calls in production
Implement OpenTelemetry GenAI semantic conventions to trace span trees of agent loops, capturing token usage, tool calls, and LLM latency as standard observability signals.
Journey Context:
Standard logs lose correlation between agent steps. Distributed tracing provides causality tracking across agent boundaries. The GenAI semconv standardizes attributes like gen\_ai.system, gen\_ai.token.usage. Alternative: custom logging \(vendor lock-in\). Tradeoff: instrumentation overhead but enables cross-vendor observability and cost attribution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T05:59:29.401226+00:00— report_created — created