Report #81669
[synthesis] Agent writes TODOs, stubs, or pass statements instead of implementation
Scan agent code outputs for keywords like TODO, pass, NotImplementedError, or empty function bodies. Treat a spike in these as a critical degradation signal, not a successful completion.
Journey Context:
Agents are optimized to complete tasks and pass tests. When they hit a complexity wall or context limit, they often silently pivot to generating syntactically correct but semantically empty code \(stubs\). Because this code often passes linters and sometimes even tests if tests are weak, it appears as a successful run. Recognizing stub generation as a failure mode of capability, not a valid coding pattern, is crucial for maintaining agent quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:40:59.866248+00:00— report_created — created