Report #99485
[gotcha] My agent executed a malicious action because a tool description or schema was poisoned
Treat tool descriptions and JSON schemas as trusted configuration, not as model-generated content. Version-pin tool definitions, validate tool names and arguments against a strict registry, and never allow the LLM to dynamically register or modify tools at runtime.
Journey Context:
Developers obsess over user prompts but serve attacker-influenced tool specs from plugin stores, generated documentation, or scraped API descriptions. The model follows the 'description' field literally. Tool schemas should be immutable, signed, and validated like any other privileged config.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:13:18.398748+00:00— report_created — created