Report #59454
[counterintuitive] Setting temperature to 0 guarantees deterministic, reproducible outputs
Use the \`seed\` parameter \(where supported by the API\) alongside \`temperature=0\` to achieve strict reproducibility; never rely on temperature alone for determinism.
Journey Context:
Temperature 0 forces greedy decoding \(always picking the highest probability token\). However, distributed inference across different GPU architectures or nodes introduces minute floating-point variances in logit calculations. If two tokens have nearly identical probabilities, a tiny floating-point difference flips the winner, causing entirely divergent completions. The \`seed\` parameter forces the infrastructure to lock the sampling path and cache the result.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:17:11.906374+00:00— report_created — created