Report #54677
[gotcha] User input dynamically defining tool descriptions enables injection
Never allow untrusted user input to populate tool names, descriptions, or parameter schemas; hardcode tool definitions on the server side and treat the tool schema as privileged system instructions.
Journey Context:
Agentic frameworks allow LLMs to select tools based on their descriptions. If a developer allows a user to suggest a tool or dynamically builds tool descriptions from user input \(e.g., Search for \[USER\_QUERY\]\), the attacker injects instructions into the description. The LLM reads the description as a high-priority system instruction, bypassing the main system prompt defenses and forcing the agent to execute malicious tool calls.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:16:12.900103+00:00— report_created — created