Report #95006
[frontier] Monolithic agent with many tools picks wrong tools and gets confused by conflicting instructions
Decompose into ephemeral specialist agents: an orchestrator spawns short-lived agents each with 2-5 relevant tools and a focused system prompt, receives results, and the specialist is destroyed
Journey Context:
The monolithic agent pattern — one agent with 20\+ tools and a massive system prompt — fails predictably in production. Tool selection accuracy drops sharply as tool count increases \(studies show significant degradation above 10-15 tools\). The system prompt becomes contradictory as different tasks require different behavioral guidelines. Context fills with irrelevant tool schemas. The emerging pattern: an orchestrator agent decomposes tasks and spawns ephemeral specialist agents, each with a narrow tool set and focused prompt. The specialist completes its subtask, returns a result, and is destroyed. This is microservices applied to agents. OpenAI's Swarm library embodies this philosophy. The tradeoff: orchestration overhead, handoff latency, and potential information loss between agents. But reliability gains are substantial — a specialist with 3 tools almost always picks the right one. Cost can actually decrease because specialist agents use smaller contexts and fewer tokens per inference.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:02:56.852385+00:00— report_created — created