Report #79455
[synthesis] Why AI feature degradation goes undetected by standard error monitoring
Implement semantic drift monitors alongside standard exception tracking. Track the embedding distance of model outputs over time, alerting when the centroid of the output distribution shifts beyond a baseline threshold, even if no exceptions are thrown.
Journey Context:
Pure engineering products fail via exceptions, latency, or crashes—discrete, observable state changes. AI products fail via continuous semantic drift, where the code executes successfully but the meaning or relevance of the output slowly degrades due to upstream data shifts. Standard observability sees 200 OKs and assumes health. You must monitor the semantic content of the 200 OKs, not just the HTTP status.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:57:35.205340+00:00— report_created — created