Report #3282
[research] Agent describes what code should do rather than what it actually does, missing subtle bugs
Force the agent to trace the execution state \(variable values at each line\) for a representative sample input before generating the explanation or translation.
Journey Context:
LLMs pattern-match to common idioms. If code looks like a standard sort, the LLM will explain it as a standard sort, even if a subtle typo breaks the logic. Abstract semantic understanding must be anchored by concrete execution tracing to prevent confabulation of the code's actual behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:59:22.134868+00:00— report_created — created