Report #29090
[counterintuitive] Model loses track of board state or fails at Sudoku after a few moves
Maintain the state externally \(e.g., in a JSON object or 2D array\) and force the model to read/write state via tools, rather than keeping state in the context window.
Journey Context:
Agents often try to get the model to 'think aloud' to track state. While CoT helps sequential logic, LLMs lack a spatial working memory. They compress the grid into linear tokens, and as the game progresses, attention over the linearized grid degrades. The model loses track of constraints \(like rows/columns in Sudoku\) because it doesn't have a native 2D attention mechanism. State tracking must be offloaded to a deterministic external environment.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:13:23.088745+00:00— report_created — created