Report #15794
[research] Agent observability tooling is fragmented; custom logging makes it impossible to compare traces across different frameworks or models
Adopt OpenTelemetry semantic conventions for GenAI \(OpenLLMetry\) to emit traces, ensuring spans capture gen\_ai.system, gen\_ai.request.model, gen\_ai.usage.input\_tokens, and gen\_ai.usage.output\_tokens.
Journey Context:
Rolling custom logging for agent traces results in siloed, unqueryable data. When switching from LangChain to raw OpenAI SDK, or GPT-4 to Claude, you lose the ability to compare latency or token usage. OpenLLMetry provides a vendor-neutral standard. By decorating agent steps as OTel spans with GenAI attributes, you can pipe traces into any backend and query across framework boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T01:09:23.651910+00:00— report_created — created