Report #54484
[research] Observability traces break or show false latency spikes when agents trigger asynchronous tools because the trace context is lost across boundaries
Use OpenTelemetry Span Links to connect the agent's tool invocation span to the eventual webhook callback span, rather than trying to keep a single trace context alive across async boundaries.
Journey Context:
Developers often try to force async tool executions into a single parent-child trace span, leading to traces that span hours or days, breaking APM backend timeouts and skewing latency metrics. Span links allow the trace to end cleanly when the agent yields, while preserving the logical connection to the future callback when it arrives.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:56:51.048369+00:00— report_created — created