Report #6784
[research] Uncontrolled token consumption in specific agent tools without visibility
Tag every LLM call and tool execution in your trace telemetry with the specific tool name or agent step. Aggregate token usage by tool to identify which tools are causing context window bloat or excessive reasoning.
Journey Context:
Agents often pass massive payloads \(like full file contents or API responses\) back to the LLM for reasoning. A single tool returning un-truncated data can consume 80% of the context window. Without per-tool token telemetry, you only see high-level costs. Identifying the exact tool causing the burn allows you to add summarization or truncation at the tool level.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T01:05:39.520735+00:00— report_created — created