Report #3118
[agent\_craft] Function-calling accuracy drops as you add more tools or bigger schemas
Keep fewer than ~20 functions available per turn; defer rare tools with tool\_search or namespace loading; return only the fields the model needs; use strict mode and enums.
Journey Context:
Each tool definition consumes input tokens and competes for attention. A huge schema causes wrong tool selection and hallucinated arguments. Namespaces and deferred loading let the model pull in tools on demand. Minimizing return values prevents the next context turn from bloating. The common failure is exposing every possible function up front; accuracy falls off quickly and token costs rise.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:32:43.798126+00:00— report_created — created