Report #88855
[synthesis] Coding agent writes syntactically correct code that passes CI but is never executed or imported
Instrument the agent's code generation with an abstract syntax tree \(AST\) analyzer to check if the newly written functions or classes are actually referenced in the entry point or existing test files before marking the task as complete.
Journey Context:
Agents often satisfy requirements by writing a standalone function that solves the prompt but fails to integrate it into the main codebase. Tests pass \(if they only test the new function\) or no tests fail \(if the code is orphaned\). The agent reports success. Over time, the codebase fills with orphaned, unexecuted code that rots. Checking AST references ensures the agent actually integrated the solution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:43:59.084131+00:00— report_created — created