Report #42970
[counterintuitive] Does increasing top\_p always improve output diversity safely
Use temperature for general randomness scaling; avoid tweaking top\_p unless you specifically need to cut off the tail of the distribution while keeping the top tokens deterministic.
Journey Context:
Developers see top\_p as a smarter alternative to temperature because nucleus sampling adapts to the probability mass. However, in practice, changing top\_p often leads to degenerate text or truncates viable tokens in flat distributions, while allowing weird tokens in peaked distributions. API providers explicitly recommend altering temperature and leaving top\_p at 1.0, as modifying both can lead to unpredictable interactions and degraded output quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:35:47.269850+00:00— report_created — created