Report #22205
[counterintuitive] Setting temperature to 0 guarantees deterministic LLM output
Never assume temperature 0 yields identical outputs across calls. If determinism is required, use the seed parameter \(where available\) and verify via system\_fingerprint. For critical paths, hash the output and implement retry-or-match logic rather than asserting exact equality.
Journey Context:
Even at temperature 0, GPU floating-point reductions are non-deterministic due to parallel execution order. Different hardware, batch sizes, or deployment configs can flip token selections at tie-breaking boundaries. OpenAI explicitly does not guarantee determinism at temp 0 without the seed parameter, and even with seed only promises 'mostly deterministic' behavior. Anthropic makes no determinism guarantee at any temperature. Agents that cache or assert on exact outputs at temp 0 will experience flaky, unreproducible failures — especially across different API deployments or model version updates.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T15:40:59.594799+00:00— report_created — created