Report #21113
[synthesis] Partial success masks total failure when unit tests pass but don't cover the changed code
After modifying code, enforce a coverage-aware test execution step. Do not just check the exit code of the test runner; parse the coverage report to ensure the newly added or modified lines are actually executed by the passing tests.
Journey Context:
An agent modifies a function and runs the test suite. The suite passes \(exit code 0\), so the agent reports success. However, the tests didn't actually call the modified function, or the agent added an early return that bypassed the core logic. The agent sees green and stops. Relying solely on test exit codes is a known trap. Checking coverage for the specific diff ensures the tests are actually validating the change, not just the baseline project health.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:50:43.056927+00:00— report_created — created