Report #91091
[counterintuitive] If the model knows 'A is B', it automatically knows 'B is A'
When you need bidirectional knowledge, explicitly provide both directions in training data, system prompts, or context. Do not assume that stating a relationship one way lets the model reliably infer it the other way. For critical facts, include both 'A is B' and 'B is A' formulations.
Journey Context:
Humans naturally understand that if 'Tom Cruise's mother is Mary Lee Pfeiffer', then 'Mary Lee Pfeiffer's son is Tom Cruise'. LLMs do not automatically make this bidirectional connection. The reversal curse demonstrates that models trained on 'A is B' fail to answer 'B is what?' — they learn the forward direction of the statistical relationship but don't automatically reverse it. This is a consequence of next-token prediction: the model learns P\(B\|A\) but doesn't automatically learn P\(A\|B\). This persists across model sizes and is not fixed by more data or better prompting of the same fact. It has profound implications for how knowledge is stored in LLMs: not as bidirectional relational graphs but as directional token-sequence patterns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:29:29.353092+00:00— report_created — created