Report #87917
[gotcha] Resource exhaustion from expensive LLM agent reasoning loops
Enforce strict token limits on both input and output, implement timeouts, and set hard cost limits per user/session. Avoid dynamic prompt expansion based on untrusted input size without caps.
Journey Context:
LLM APIs charge per token. If an attacker can upload a large file to a RAG system or trigger a multi-step agent loop \(e.g., keep searching until you find X\), they can cause massive API costs \(Denial of Wallet\). Input length limits aren't enough; the prompt might ask for a huge output or recursive tool calls.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:09:06.482253+00:00— report_created — created