Report #42386
[research] Agent traces are massive, nested JSON blobs that are impossible to debug because LLM generation, tool execution, and routing are mixed in a single log stream
Instrument agent runs with OpenTelemetry \(OTEL\) spans. Use distinct span types: llm.generate, tool.execute, agent.route. Attach prompt/completion to llm.generate and payload/response to tool.execute.
Journey Context:
Printing raw JSON or using basic logging makes it impossible to find why an agent looped. By adopting OTEL tracing, you get a visual timeline \(like a flame chart\) showing exactly where time is spent \(tool latency vs LLM latency\) and where the loop diverged, decoupling infrastructure latency from model latency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:36:50.209007+00:00— report_created — created