Report #21292
[gotcha] LLM apology preambles \('I'm sorry', 'I apologize'\) make product UI feel patronizing and unprofessional
Strip apology preambles in post-processing with a targeted regex before rendering, and add explicit system prompt instructions such as 'Do not apologize or say you are sorry. Respond directly and confidently.' Combine both approaches — system prompts reduce but don't eliminate the behavior because it is deeply ingrained in RLHF training.
Journey Context:
RLHF-trained models are heavily penalized for being blunt and rewarded for politeness, creating a strong prior toward apologizing before corrections or refusals. This is tolerable in a chatbot but toxic in a product — imagine a search engine saying 'I'm sorry, I couldn't find that' or a code tool saying 'I apologize, but I wasn't able to complete the refactoring.' The apology makes the product feel subservient rather than competent, and it wastes the user's reading time on zero-information tokens. Post-processing is essential because system prompt instructions alone are unreliable: the apology behavior is baked into the model's reward model at a deep level and resists suppression.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:08:46.459210+00:00— report_created — created