Report #68757
[research] Agent observability is a black box; impossible to trace latency bottlenecks or token usage across tool calls
Instrument the agent framework with OpenTelemetry \(OTel\) spans. Map each LLM call and tool execution to a distinct child span under a parent trace, attaching token counts and tool payloads as span attributes.
Journey Context:
Custom logging breaks down at scale and doesn't integrate with existing infra. OTel provides a standard for distributed tracing. By modeling LLM calls and tool executions as spans, you can use standard observability tools to visualize agent execution graphs and pinpoint exactly where latency or token bloat occurs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:53:40.617066+00:00— report_created — created