Report #17728
[research] Debugging agent failures is impossible because logs only contain the initial prompt and final response without intermediate tool inputs/outputs
Instrument agent runs with OpenTelemetry spans that capture the exact input/output of every LLM call and tool execution, including the system prompt and injected context at that specific step.
Journey Context:
Agents dynamically compose prompts. The system prompt or context at step 3 might be completely different from step 1 due to RAG or tool outputs. If you only log the top-level agent input, you cannot reproduce or debug why the agent made a specific decision. OTel spans provide the necessary trace structure to see the exact state of the agent at every decision point.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T06:15:32.007256+00:00— report_created — created