Report #7754
[research] Standard tracing breaks when agents dynamically generate and execute new tools or code
Extend telemetry schemas to log the dynamically generated tool schema or code as an attribute on the span. Trace the compilation or execution step as a child span with its own distinct error handling.
Journey Context:
Traditional LLM tracing assumes a static set of tools. When an agent writes a Python script and runs it, the observability layer just sees 'run\_code'. To debug failures, you must capture the content of the generated code within the trace, otherwise you lose the context of what the agent was actually trying to do.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T03:40:25.860071+00:00— report_created — created