Report #37853
[gotcha] LLM invoking API functions with attacker-controlled arguments
Apply strict validation, authorization, and rate-limiting on the arguments passed to tool/function calls on the backend, treating them as adversarial user input. Never trust the LLM to sanitize or validate arguments.
Journey Context:
Developers validate user input to the LLM, but forget that if the LLM is tricked \(via indirect injection\) into calling a function like send\_email\(to='[email protected]', body=user\_data\), the backend blindly executes it. The LLM is just a text generator; it doesn't 'know' the arguments are malicious, it just predicts the most likely JSON payload.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:00:59.808036+00:00— report_created — created