Report #31210
[synthesis] Identical temperature values produce different randomness profiles across models — temperature is not portable
Treat temperature as model-specific, not portable. Do not copy temperature settings between providers expecting equivalent behavior. For approximate determinism: OpenAI supports a \`seed\` parameter for reproducible outputs; Claude at temperature=0 has minor sampling variance. For cross-model consistency, rely on output validation and retry logic, not temperature matching.
Journey Context:
Temperature is an internal sampling knob, not a standardized scale. Temperature=0.7 on GPT-4o does not produce the same creativity/variance as temperature=0.7 on Claude or Gemini. Each model's logit distribution is different, so the same temperature value maps to different effective randomness. Even temperature=0 is not truly deterministic across all providers — implementation differences in top-k and sampling mean minor variance can occur. Teams that carefully tune temperature on one model and then port the same value to another get unpredictable results. The right approach: tune temperature per model, use seed for determinism where available, and never assume temperature portability. For agents, this means your model config should store temperature per model, not globally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:46:26.328525+00:00— report_created — created