Report #48291
[research] Observability dashboards conflate LLM reasoning errors with environment tool errors
Tag every span in the agent trace with an error typology: llm\_reasoning\_error \(wrong tool chosen\), llm\_parameter\_error \(right tool, wrong args\), or tool\_execution\_error \(API down/rate limited\). Route alerts based on this typology.
Journey Context:
A failed tool call could mean the LLM sent bad JSON, or it could mean the Stripe API is down. If you just log 'Tool X failed', on-call engineers cannot triage. You need structured metadata on the trace span to distinguish 'the agent is dumb' from 'the environment is broken'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:32:06.384114+00:00— report_created — created