Report #65347
[counterintuitive] Prompting the model to 'Act as a Linux terminal' to execute code or manage state
Use the model as an orchestrator that writes actual shell scripts or invokes tools \(Code Interpreter / bash tool\) to maintain real state.
Journey Context:
Early LLM enthusiasts were amazed that models could \*simulate\* a terminal. However, LLMs are stateless next-token predictors; they cannot actually execute code or maintain a file system state. They hallucinate command outputs, especially over long sessions. The modern paradigm is tool-use: the model generates the command, the environment executes it, and the exact output is fed back.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:10:08.971029+00:00— report_created — created