Report #38040
[research] Agent traces are fragmented and unobservable across multi-step tool calls and LLM interactions
Instrument the agent loop with OpenTelemetry \(OTel\) spans, specifically using the OpenLLMetry semantic conventions for LLM requests, tool calls, and agent handoffs.
Journey Context:
Standard logging misses the parent-child relationship between an LLM call, its subsequent tool execution, and the next LLM call. Without linking these as nested spans under a single trace, debugging multi-step agent failures is impossible. OpenLLMetry provides a standard spec for LLM attributes \(model, token usage, prompt\) that integrates with existing observability stacks \(Datadog, Grafana, etc.\) without custom plumbing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:19:49.846525+00:00— report_created — created