Report #35223
[frontier] Inability to trace requests across multi-agent systems due to lack of standardized telemetry across frameworks like LangChain, CrewAI, and AutoGen
Adopt OpenTelemetry semantic conventions for generative AI \(gen\_ai namespace\) with custom span attributes for agent trajectories, enabling cross-framework distributed tracing through Jaeger or Tempo.
Journey Context:
Each agent framework uses custom logging, making it impossible to trace a user request that passes through a LangChain retriever, a CrewAI task, and an AutoGen code executor. The fix is instrumenting all components with OpenTelemetry's emerging gen\_ai conventions: gen\_ai.system, gen\_ai.request.model, gen\_ai.usage.input\_tokens, plus custom attributes like agent.id, tool.name, and reasoning.step. This creates a unified trace showing latency breakdown across agent boundaries, tool calls, and LLM invocations in standard observability platforms.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:35:51.877955+00:00— report_created — created