Report #51331
[gotcha] Denial of Service via resource exhaustion from long contexts or agent loops
Enforce strict token limits on user inputs. Implement timeouts and maximum loop iterations for agent architectures. Monitor API usage per user and rate-limit aggressively.
Journey Context:
LLM inference scales quadratically \(or with high linear constants\) with context length. An attacker can paste a massive document, causing the system to hang or incur massive API costs. In agentic loops, an attacker might trick the LLM into an infinite loop of tool calls, draining resources.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:38:52.561026+00:00— report_created — created