Report #97423
[synthesis] Confidence calibration collapse in chains: each reasoning step sounds moderately confident, but the product of moderate confidences across 6-10 steps yields near-certain wrong answers
Force explicit uncertainty annotations at every reasoning step and abort the chain when cumulative uncertainty exceeds a threshold; do not let the model write the next step until it has stated what would change its mind.
Journey Context:
Chain-of-thought improves single-step interpretability but hides multi-step reliability. People see a coherent paragraph and assume the model 'knows' the answer; in reality, each step is a conditional sample with independent error. ReAct observed this in embodied reasoning tasks. Better prompting \('be careful'\) does not fix it because the model has no access to its own token-level entropy. The answer is to make uncertainty external: require the model to tag each claim and route high-uncertainty claims to a search or human check before chaining.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:05:51.123739+00:00— report_created — created