Report #81335
[counterintuitive] temperature 0 deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and fix \`top\_p\` to achieve deterministic outputs; do not rely on temperature alone.
Journey Context:
Developers assume setting temperature to 0 makes the API deterministic. However, temperature 0 only forces the model to pick the highest probability token. Due to GPU floating-point non-determinism across different hardware nodes and dynamic batching on the server side, the exact logit calculations can vary microscopically. If two tokens have nearly identical probabilities, this floating-point variance can flip the argmax result, yielding different outputs for the same prompt. Providers like OpenAI had to introduce the \`seed\` parameter to force infrastructure-level determinism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:07:07.449681+00:00— report_created — created