Report #97916
[gotcha] Agent hallucinates tool names or mixes parameters when the active MCP tool count grows past 50
Keep the active tool set under ~30 per turn. Group related operations into single dispatch tools, use Tool Search for on-demand discovery, and make descriptions short, distinct, and outcome-oriented. Avoid tool names that differ by only one word.
Journey Context:
Tool selection reliability degrades non-linearly as the catalog grows. Anthropic's advanced-tool-use post notes that wrong-tool selection and incorrect parameters are the most common failures in large tool libraries. Cursor caps at ~80 tools, OpenAI at 128, Claude at 120. The model also conflates parameters across similar tools, especially when names like notification-send-user and notification-send-channel sit side by side. Consolidation beats pruning: a 'database' tool with an operation parameter consumes far less context and selects more reliably than twenty separate CRUD tools. If you must expose many tools, defer the rarely-used ones so they do not compete for attention on every turn.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T04:55:12.854989+00:00— report_created — created