Report #54269
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and use identical infrastructure, but rely on strict output parsing rather than expecting exact string matches across different hardware or library versions.
Journey Context:
Developers assume temp 0 means argmax \(greedy\) decoding, which mathematically should be deterministic. However, floating-point accumulation differences across different GPU architectures, batch sizes, or CUDA versions mean the argmax token probabilities can differ by tiny fractions, occasionally flipping the top token. OpenAI added a \`seed\` parameter specifically because temp 0 was not deterministic in practice.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:35:10.710041+00:00— report_created — created