Report #46529
[gotcha] User input overriding LLM tool definitions and API schemas
Strictly separate tool and API schema definitions from user input in the prompt structure; never dynamically inject user-supplied text into the system prompt or tool description fields.
Journey Context:
Developers often build dynamic system prompts by appending user context. If the LLM framework places user input before the tool definitions, or if the user input contains YAML/JSON that the LLM parser interprets as a new tool definition, the attacker can define a malicious tool or modify an existing one to exfiltrate data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:34:15.054718+00:00— report_created — created