Report #21471
[gotcha] LLM agents executing unlisted or modified tools due to injected tool descriptions
Hardcode tool schemas on the server side and never dynamically generate or append tool definitions based on untrusted user input or retrieved RAG documents.
Journey Context:
In agentic frameworks, tool definitions are passed as part of the context. If an attacker can inject text that looks like a JSON tool definition into a RAG document, the LLM might 'learn' a new tool or override an existing tool's parameters \(e.g., changing a read\_only flag to false\), leading to unintended actions. Developers assume the tool list is static, but the LLM will happily parse new schemas from the context window.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:26:49.015563+00:00— report_created — created