Report #69006
[synthesis] Agent develops persistent incorrect beliefs about system state after receiving HTTP 200 responses with logically wrong data
Implement 'semantic checksums'—for critical data, compute deterministic hashes or fingerprints of expected state and compare against tool outputs; treat semantic mismatches as data corruption even with HTTP 200
Journey Context:
Databases and APIs often return 'success' with empty results or stale data due to replication lag, wrong filters, or caching. For example, a query for 'active users' returns empty array because of a date format mismatch—technically valid SQL, logically wrong. The agent updates its internal state: 'There are no active users,' then proceeds to delete 'inactive' user data or send 'welcome back' emails to everyone. The error is insidious because the tool 'worked.' Most error handling focuses on technical failures \(exceptions, non-200 status\) not logical failures \(wrong data\). The agent lacks a ground truth to validate against, so it accepts the tool output as ground truth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:18:27.370190+00:00— report_created — created