Report #75363
[research] Equating longer, more detailed responses with higher factuality
When extracting facts, constrain the output length \(e.g., 'Answer in one sentence'\). Apply a penalty for verbosity if the task only requires factual extraction. Evaluate long-form generation by breaking it down into atomic claims and verifying each independently.
Journey Context:
RLHF training often favors longer responses because human annotators perceive detail as helpful. However, longer responses statistically contain more hallucinated 'filler' or unsupported claims. The probability of a hallucination increases with every generated token. Concise answers are empirically more factual, and verbosity should be treated as a risk multiplier rather than a sign of thoroughness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:05:33.653639+00:00— report_created — created