Report #62254
[frontier] Agents using unstructured string thoughts for intermediate state fail to reliably pass data between steps
Enforce structured scratchpads \(JSON or Pydantic models\) for agent intermediate memory, replacing free-text chain-of-thought.
Journey Context:
CoT prompting relies on the LLM writing natural language thoughts, which deterministic code then tries to parse. As context length grows, LLMs alter the format of their own scratchpads. By forcing the agent to write intermediate state into strictly typed schemas \(using tool calls or structured outputs for internal state updates\), the orchestrator can reliably branch logic and persist state across long sessions without regex parsing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:58:53.800128+00:00— report_created — created