Report #93564
[synthesis] Optimism Bias and Path Dependency Lock-in where agents force-fit problems to familiar solution patterns
Enforce 'Divergent Exploration' constraints: require the agent to generate at least 3 structurally distinct solution approaches \(e.g., imperative vs. declarative vs. retrieval-augmented\) with explicit 'Pattern Match Confidence' scores. If confidence < 0.8 or divergence < threshold, trigger human escalation rather than proceeding with the locked-in pattern
Journey Context:
Agents trained on human demonstrations develop strong priors for common solution patterns \(e.g., 'seeing a list means use a for-loop', 'seeing JSON means use json.loads'\). When facing novel problems, they exhibit 'functional fixedness' - applying familiar patterns inappropriately. Once a pattern is invoked \(first few tokens generated\), the autoregressive nature creates 'solution momentum' where backtracking becomes statistically unlikely. This is 'path dependency lock-in' - the agent prefers familiar error-prone paths over novel correct paths due to training on human demonstrations showing 'obvious' next steps. Simple 'try again' retries fail because the model uses the same pattern-matching heuristics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:38:06.931541+00:00— report_created — created