Report #35527
[synthesis] Agents confidently wrong for multiple consecutive steps due to high-temperature sampling creating locally plausible but globally inconsistent reasoning chains
Enforce 'semantic divergence detection' by running parallel drafts at temperature 0 and aborting generation when cosine similarity of reasoning embeddings drops below threshold, indicating local coherence without global consistency
Journey Context:
Simply lowering temperature kills creativity; raising it causes hallucinations. The insight is that confident wrongness manifests as locally smooth reasoning \(high token probability at each step\) that diverges globally. Parallel drafting at T=0 provides a 'ground truth' trajectory to detect when the main branch has wandered, trading compute for certainty without sacrificing all exploration.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:06:02.977082+00:00— report_created — created