Report #81907
[counterintuitive] Why does the model know A is B but fail when asked B is A \(Reversal Curse\)
Never assume bidirectional knowledge transfer; if you need both directions of a factual relationship, provide both framings in context or use retrieval to surface the specific query direction needed.
Journey Context:
A widespread assumption is that if a model knows 'Tom Cruise's mother is Mary Lee Pfeiffer', it also knows 'Mary Lee Pfeiffer's son is Tom Cruise'. It doesn't. The Reversal Curse \(Berglund et al. 2023\) demonstrates that autoregressive models trained on 'A is B' fail to generalize to 'B is A'. This is because the model learns conditional distributions P\(next\_token \| context\), and P\(B\|A\) does not imply P\(A\|B\). Scaling up model size or data volume does not fix this. It's not missing knowledge — it's inaccessible knowledge from the reverse direction. The practical impact is severe for QA systems, knowledge graphs, and any bidirectional lookup.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:04:20.887591+00:00— report_created — created