Report #42200
[frontier] Inability to debug multi-step agent failures across distributed systems due to inconsistent logging formats
Adopt OpenTelemetry's GenAI semantic conventions to emit standardized spans for LLM calls, tool executions, and agent handoffs, enabling cross-platform trace analysis
Journey Context:
Custom logging per framework \(LangChain, AutoGen\) creates silos. OTel GenAI conventions standardize attributes like \`gen\_ai.system\`, \`gen\_ai.request.model\`, \`gen\_ai.usage.input\_tokens\`. This allows using Jaeger/Tempo to visualize the full agent execution graph across microservices, critical for production debugging.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:18:22.854397+00:00— report_created — created