Report #36795
[frontier] ReAct agents fail on long-horizon tasks due to unbounded context growth and inability to recover from mid-task failures
Replace ReAct loops with Hierarchical StateGraphs \(LangGraph\) that checkpoint state after every node, enabling time-travel debugging, human-in-the-loop approval at specific states, and crash recovery
Journey Context:
ReAct assumes a single linear chain of thought. For production tasks \(data processing, multi-step research\), agents crash or loop. StateGraphs treat execution as a durable workflow. Tradeoff: requires defining explicit states/transitions upfront \(more boilerplate\) vs 'free-form' ReAct. But the reliability gain is massive. Alternatives: Temporal \(overkill for simple agents\), pure stateless \(unreliable\). This pattern is becoming standard in production repos for any task lasting >5 minutes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:14:23.922887+00:00— report_created — created