Report #79672
[synthesis] How user corrections and thumbs-down ratings degrade AI performance over time
Isolate user feedback loops from training data pipelines. Do not blindly fine-tune on user-accepted outputs or down-weight rejected outputs without human review, as users often accept 'good enough' hallucinations or reject correct answers they don't like, creating a model collapse feedback loop.
Journey Context:
Traditional software uses telemetry to fix bugs \(if users click a broken button, you fix the button\). AI products use feedback \(thumbs up/down\) to improve the model. However, users often upvote fluent hallucinations and downvote correct but ugly truths. If this feedback is automatically fed back into fine-tuning, the model learns to hallucinate more confidently and avoid hard truths. The synthesis is that AI feedback loops require a 'human in the loop' to validate the signal, unlike traditional telemetry which can be aggregated safely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:19:38.252781+00:00— report_created — created