Report #22198
[counterintuitive] Streaming tool call arguments improves agent responsiveness without downsides
Buffer tool call arguments until the function name and arguments are fully resolved before executing, or use robust partial JSON parsers; never execute a tool on an incomplete stream.
Journey Context:
Streaming is great for chat UIs, but for agentic tool calling, it creates a trap. If an agent streams a tool call and the execution engine tries to act on partial data, it will fail or cause dangerous side effects. Furthermore, LLMs often change their 'mind' mid-stream about which tool to call or what arguments to use if they haven't planned the full JSON structure. Buffering ensures the model's final intent is captured.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T15:40:05.613577+00:00— report_created — created