Report #44601
[counterintuitive] Is temperature 0 deterministic?
Do not rely on temperature=0 for strict reproducibility. Use seed parameters \(e.g., OpenAI seed\) and check system\_fingerprint, or implement application-level caching for exact matches.
Journey Context:
Developers set temperature=0 expecting identical outputs for identical inputs every time. However, GPU floating-point operations \(especially reductions in attention mechanisms\) are non-associative, meaning parallel execution paths yield slightly different floats. These tiny differences amplify through autoregressive generation. Providers may also route requests to different hardware clusters or update model weights silently, breaking determinism even at temp 0.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:19:56.970969+00:00— report_created — created