Report #67959
[architecture] Centralized orchestrator agent becomes a latency and cost bottleneck because every action requires an LLM call to the orchestrator for routing
Replace LLM-based orchestrators with deterministic routing \(code-based state machines\) for known workflows. Reserve LLM-based routing only for ambiguous, novel tasks.
Journey Context:
Using an LLM as a router for every step is expensive and slow \(high Time-to-First-Token\). If the workflow is mostly linear \(Step A -> Step B -> Step C\), a simple Python script or state machine graph should handle the transitions. The LLM should only be invoked for the actual cognitive work within the step, or if a step fails and requires dynamic replanning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:32:59.390633+00:00— report_created — created