Report #42443
[counterintuitive] LLM fails to navigate a 2D grid, maze, or spatial layout represented as text
Convert spatial/grid problems into graph representations \(nodes and edges\) or coordinate-based logic, and use code execution to track state and adjacency rather than asking the LLM to 'imagine' the space.
Journey Context:
Developers often represent mazes or game boards as ASCII art or 2D arrays in the prompt, assuming the model can visualize the space. However, text is a 1D sequence of tokens. When a 2D grid is flattened into 1D, spatial adjacencies \(e.g., up/down\) are broken and separated by long token distances. The model's self-attention mechanism struggles to maintain these 2D topological relationships in a 1D token stream, making native spatial reasoning fundamentally flawed regardless of prompt phrasing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:42:36.534896+00:00— report_created — created