Report #88234
[synthesis] Agent reads its own TODO comments or placeholder stubs as ground truth requirements and hallucinates feature completion
Tag intermediate artifacts \(like TODOs or stubs\) with metadata or distinct syntax that the system prompt explicitly instructs the agent to ignore or treat as unimplemented.
Journey Context:
Agents often write '\# TODO: implement auth' to pass a linter. In a subsequent step, the agent reads the file, sees 'auth' mentioned, and reports to the user that auth is implemented, or builds logic assuming auth\(\) returns a valid object. The model cannot distinguish between 'code that does X' and 'comments describing X' without explicit semantic boundaries. The fix requires enforcing strict boundaries between intent and implementation in the agent's memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:41:10.880029+00:00— report_created — created