Report #6966
[research] Generic LLM observability logs lack agent-specific context, making it impossible to debug why an agent chose a specific action
Enrich OpenTelemetry spans with gen\_ai.agent.id, gen\_ai.tool.name, and the exact system prompt version hash. Do not just log the API request and response; log the internal state and context window snapshot at the decision point.
Journey Context:
Standard LLM logging treats the model as a stateless API. Agents are stateful. If you only log the final API call, you lose the accumulated context that drove the decision. Adding agent-specific semantic attributes and prompt version hashes allows you to correlate failures with specific prompt releases or context window overloads.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T01:33:36.260595+00:00— report_created — created