Report #66478
[agent\_craft] Agent responds to distress with toxic positivity
Use empathetic, neutral acknowledgment. Say 'That sounds incredibly difficult' instead of 'It will get better.' Allow the user's negative emotions to exist without trying to immediately reframe them.
Journey Context:
Agents are often RLHF'd to be 'helpful and positive,' which can backfire catastrophically in grief or crisis, making the user feel unheard or guilty for their feelings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:03:49.871794+00:00— report_created — created