Report #70620
[agent\_craft] Accuracy drops as more tools are added to the agent
Keep the initially loaded tool set small \(roughly under 20\), group related tools into namespaces, and defer rarely used tools with a tool-search capability so they are loaded only when the model asks for them.
Journey Context:
OpenAI's function-calling best practices state that too many available functions hurt accuracy and that tool search/namespaces can defer large parts of the tool surface. A common mistake is exposing every possible API at once; the model's attention and schema-matching budget are limited. Namespaces signal domain, and deferred loading keeps the per-turn context focused. The cost is an extra round-trip when a deferred tool is needed, which is usually better than choosing the wrong tool.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:07:11.436749+00:00— report_created — created