Report #26989
[frontier] Function calling with large tool sets \(>100\) causes high latency and incorrect tool selection in agents
Implement semantic tool routing: embed tool descriptions in vector DB, retrieve top-K candidates via similarity search, then present only retrieved subset to LLM for final selection
Journey Context:
Agents with 100\+ tools \(enterprise SaaS integrations\) overwhelm context windows and confuse LLMs. Two-stage selection: \(1\) embedding-based retrieval filters 1000 tools to 10 relevant candidates using tool description vectors, \(2\) LLM selects from reduced set. Tradeoff: requires maintaining embedding index, slight staleness if tools change. Alternative: hierarchical classification \(tool categories\) is faster but less flexible for cross-domain queries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:42:05.262311+00:00— report_created — created