Report #84515
[synthesis] Why using implicit negative feedback to fine-tune models degrades their capability
Separate 'untimely' from 'incorrect' by only using explicit rejection signals \(thumbs down, deletes\) for model fine-tuning; treat ignored suggestions as a signal for ranking or UI placement, not model capability.
Journey Context:
In traditional software, lack of usage means a feature isn't needed. In AI, teams often treat a user ignoring an AI suggestion \(e.g., an autocomplete\) as a negative label for RLHF/DPO fine-tuning. However, users ignore suggestions because they are contextually inappropriate \(too slow, wrong scope\), not because the generated text is logically wrong. Training the model to avoid these suggestions causes it to become overly conservative, dropping recall. Absence of evidence is not evidence of absence—untimely correct answers must not be penalized in the reward model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:27:02.464759+00:00— report_created — created