Report #75069
[synthesis] Temperature 0 does not guarantee deterministic outputs for testing
For strict determinism, do not rely on temperature=0. Fix the random seed \(if supported, like GPT-4o's seed parameter\) and set top\_p=1 alongside temperature=0. Design agents to be resilient to minor output variance.
Journey Context:
Developers set temp=0 expecting exact reproducibility for debugging. They get different outputs on identical prompts. OpenAI introduced a seed parameter specifically to address this, acknowledging temp=0 isn't enough. Anthropic and Google lack a seed parameter, meaning absolute determinism is impossible; their implementations of top-p sampling at temp=0 still introduce slight variance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:36:18.106705+00:00— report_created — created