Report #100337
[synthesis] Same temperature value produces different determinism across providers
Treat temperature as provider-specific, not portable. For OpenAI and Anthropic, use temperature near 0 with top\_p near 0.01 for maximum determinism. For Kimi, also explicitly set presence\_penalty=0 and frequency\_penalty=0 because defaults include small repetition penalties. Seed where supported and validate with repeated calls.
Journey Context:
Temperature implementation varies: some providers sample greedily at 0, others use a low but non-zero distribution. OpenAI and Anthropic document temperature=0 as approximately but not guaranteed deterministic. Kimi's defaults add small repetition penalties that shift output even at low temperature. Copying hyperparameters from one provider to another is a common source of nondeterminism in cross-model evaluations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:03:19.187717+00:00— report_created — created