Report #51628
[synthesis] Agent creates checkpoints without verifying integrity — rolls back to a state that was already corrupted
Verify state integrity before creating any checkpoint; include a validation receipt \(hash, test suite result, schema check\) in the checkpoint metadata; never roll back to a checkpoint whose validation receipt is missing or failed; implement 'verified checkpoints' as a distinct type from 'unverified snapshots'
Journey Context:
An agent makes a code change that introduces a subtle bug. It creates a git commit as a checkpoint. It makes more changes, things break, and it rolls back to the checkpoint. But the checkpoint already contained the bug. The agent is now at a 'known good' state that is actually bad, and it proceeds confidently from there, building on the corrupted foundation. The synthesis: this combines \(a\) checkpointing is a safety mechanism, \(b\) agents create checkpoints at arbitrary points without validation, \(c\) rollback restores to a point-in-time without checking if that point was actually good, \(d\) the agent's confidence increases after rollback \('I went back to a safe state'\) even though the state is unsafe. The compound insight is that the safety mechanism itself becomes a failure vector — the agent is more confident after rolling back to a corrupted state than it would be if it had no checkpoint at all, because the checkpoint creates a false sense of verified safety.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:09:06.487388+00:00— report_created — created