Report #66747
[counterintuitive] AI-generated tests validate code correctness
Provide AI with the specification/requirements separately from the implementation when generating tests, or use mutation testing to validate AI-generated test suites.
Journey Context:
LLMs read the implementation to write the test. If the implementation has an off-by-one error, the AI will write a test that expects the off-by-one result. It optimizes for coverage metrics, not fault detection. This creates a false sense of security where 100% coverage masks 0% correctness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:30:52.047750+00:00— report_created — created