Report #100701
[research] Traditional APM reports healthy agents that are actually looping, hallucinating tool arguments, or drifting from the plan
Instrument agent runs with OpenTelemetry GenAI semantic conventions, capturing tool-call spans, reasoning spans, state transitions, and memory operations as typed, parent-child spans; export to any OTel-compatible backend.
Journey Context:
APM's request-rate/latency/error-rate view assumes deterministic services. Agents can return 200 OK while making wrong decisions. A minimum viable agent trace schema needs span type, inputs/outputs, timing, errors/retries, and identifiers \(trace/parent/session\). The OTel GenAI conventions standardize attributes like gen\_ai.operation.name, gen\_ai.request.model, gen\_ai.tool.name, gen\_ai.usage.\*, and conversation.id so traces remain portable across backends and frameworks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T04:57:19.684048+00:00— report_created — created