Report #67906
[frontier] Agent traces are fragmented and vendor-locked across different LLM providers
Adopt OpenTelemetry GenAI Semantic Conventions: instrument agents using OTel spans with standardized attributes \(gen\_ai.system, gen\_ai.request.model, gen\_ai.usage.input\_tokens\). Propagate trace context through multi-agent handoffs using OTel Baggage to maintain causal chains across service boundaries.
Journey Context:
Teams instrument LLMs with ad-hoc logs—LangSmith here, custom JSON there—creating vendor lock-in and fragmented traces that can't correlate LLM calls with underlying database queries. The frontier pattern uses the 2025 OTel GenAI semantic conventions \(stable in v1.27.0\+\). This standardizes span attributes for requests, responses, token usage, and function calling across providers \(OpenAI, Anthropic, Bedrock\). Crucially, it enables distributed tracing: when Agent A calls Agent B, the OTel trace context propagates via headers/Baggage, creating a single trace tree showing the full multi-agent execution flow. The complexity is managing the experimental status—you must pin to specific semantic convention versions and use instrumentation libraries like OpenLLMetry that support the gen\_ai namespace.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:27:53.265935+00:00— report_created — created