Report #58075
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and use consistent configurations \(like \`top\_p=1\`\) to get deterministic outputs, but even then, hardware-level floating point differences across distributed systems can cause slight variations.
Journey Context:
Developers think temp 0 means greedy decoding \(argmax\), which is deterministic mathematically. But LLM APIs run on distributed GPU clusters where floating-point arithmetic isn't perfectly commutative across different hardware/CUDA versions, and API providers might route to different model weights/versions. OpenAI introduced the \`seed\` parameter specifically to address this non-determinism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:58:07.046137+00:00— report_created — created