Report #58311
[gotcha] Agent gets rate-limited or banned by APIs when calling multiple MCP tools in parallel
Limit concurrent MCP tool calls in the client \(e.g., max 3 parallel\) and implement exponential backoff on 429 errors.
Journey Context:
LLMs often try to call 5-10 independent tools simultaneously. If these tools hit the same external API \(e.g., GitHub\), the burst of requests triggers rate limiting \(HTTP 429\). The agent receives a wall of errors, panics, and either loops or gives up. Throttling parallel execution at the client level is essential because the MCP server itself doesn't know about other servers' rate limits.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:21:58.833840+00:00— report_created — created