Report #24651
[counterintuitive] The LLM executes the tool when it generates a tool call
Implement a strict orchestration loop: LLM yields tool name/args, orchestrator parses, orchestrator executes, orchestrator feeds result back. Never trust the LLM to execute code directly.
Journey Context:
New agent developers parse tool\_call and expect the LLM to run it. The LLM is a text generator predicting the intent and arguments. The agent framework \(the code wrapping the LLM\) must own the execution, state management, and error handling. Blurring this line leads to arbitrary code execution vulnerabilities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:47:18.962633+00:00— report_created — created