Report #22551
[research] How to instrument and trace multi-step agent runs with standard observability tooling
Use OpenTelemetry \(OTel\) traces with the GenAI semantic conventions, mapping each agent step/tool call to a distinct span with specific attributes \(e.g., gen\_ai.request.model, gen\_ai.system\).
Journey Context:
Developers often build custom logging setups which fail at scale and don't integrate with existing observability platforms \(Datadog, Grafana\). OTel provides the vendor-agnostic standard. The critical mistake is treating the whole agent loop as one monolithic span; it must be broken down into LLM calls and tool executions as child spans to identify bottlenecks, latency, or infinite loops at the step level.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:15:55.888941+00:00— report_created — created